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CORRECTION FOR THE MOMENTS OF A 
FREQUENCY DISTRIBUTION IN 


TWO VARIABLES! 


By 


WILLIAM DoweELLt BATEN 
University of Michigan 


In certain statistical problems it is beneficial to divide the 
given data into classes or groups and investigate the distribution 
in this form. The moments determined for the distribution div- 
ided into classes differ from the moments determined from the 
original data. It is the object of this article to show how to 
modify the former to secure the latter for a frequency distribu- 
tion in two variables. 

After the data, given for a frequency distribution of one 
variable, haye been divided into classes the class mark is then 
the representive of the items in a class. This is assuming that 
the mean of the items falling in a class is equal to the class mark. 
For a large number of items in a class, distributed throughout 
the entire class, the class mark differs very little from the average 
of the items in the class. But the average of the items raised 
to a power is not equal to the class mark raised to the same 
power. Hence corrections should be made to the moments de- 
termined from a distribution which is divided into classes. 

For a distribution of two variables a and y the data are 
divided into xy -classes, where the class mark of an x -class 


'Presented to the American Mathematical Society, Sept. 12, 1930. 
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is considered to be the representive of the items falling in this 
class, while the class mark of a y -class is the representive of 
all items in this particular class. The coordinates of the point 
in the acy -plane, whose abscissa is the class mark of the w -class 
and whose ordinate is the class mark of -the y -class, may be 
considered to be the class mark of the double class or the acy - 
class. 

Let the frequencies of the distribution be represented by 
the volumes of the volume-compartments as shown jn the figure. 
The sum of all such compartments is the total of the frequencies 
and should be equal to the number of items in the distribution. 
The little solid HMWQI-SPRDis the frequency of the items 
falling in the Sth & -class and in the 3rd y -class) OT and OF 
are the class marks of this x-class and this y -class. (o7)" (oF)”” 
multiplied by the frequency of the items falling in this double 
xy-class may differ considerably from the sum (OC. OK nd 
+(OA)"(OG)™+. . |, hence corrections must be made to 
the moments obtained from the distribution divided into classes 
where the double class marks are the representives of the items 
in the class. If the class units are made smaller and are allowed 
to become very near to zero the errors are not so large, for it 
must be remembered that our results are only approximations. 

By definition the 7 ’th, 77’th moment of the distribution 
which is divided into classes 1s 


‘ 


Va: a" 


cm "LL tehyetan dk, 


where (2; , y, ) is considered to be the class mark of the 
Se J -class, wall the double summation extends over all the classes. 
It is further assumed. that f (x; +h, Y; + kK ) is such a func- 
tion which can be expanded into a Taylor series. The above 
becomes 
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Now use the Euler-Maclaurin Summation* formula for two 


variables for finding the value of this double summation. This 
formula is 


*This formula is developed on pages 317-319. 
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which is the double summation of the function U (x,y) from 
a to & on the x-axis and from c to d along the y -axis. 
Applying this formula to the double summation above 
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t is an even number. In obtaining this result it was assumed 
that f(x,y), f(xy), x"yFlxy), x«*yF' (x,y) 
vanish or become negligible at the limits on the 2 and y axes, 
k and w are positive integers. 
Therefore 
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If 777=0 the formula becomes the formula for obtaining the 
moments about a fixed origin for one variabie. This has been 
done by Sheppard and Carver. 

If 1 and 77 take on integral values 
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In making corrections for the double moments it must be 
remembered to correct the single moments of the x’s and the y’s.* 


EULER-MACLAURIN SUMMATION FOR Two VARIABLES 


Suppose it is possible to find a function g (%,y) such 
that 9 (xt) Y+/)-G(X+1, Y)-G( X,Y +!) + (ZY) = 
f(x,y), or AB 9(x%,y) =f (x,y) or AX ' tay 
=9(X,Y), where A represents finite difference and r 


represents finite integration. If g (2,y) is such a function, 
then 


glarl,c+!)-g(a+l, d-9(a ,ct)+9a , D=Fla , a, 


g(a+2,c+I)-g9@tZ2 c)-gGlatlc+l)+glarl, c)=Flatl, o), 
g(arl,c+2)-g(atlc+l)-gla ,c+2)+9(a ,c+lI)=Fla c+) 


9(a+2,c+2)-9(ar2, c+/)-9a+l, cr) +9(a+l,c+/)= fla+l c+l), 


*See Frequency Curves by H. C. Carver in Handbook of Math. Statistics. 
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If it is pssible to find the function g9(<,y) then the 
double sum 2 z f(a, y ) can be found. Expand g(x +/, y+!) 


in a Taylor ‘series. 
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where the D)’s are operators operating on the function g (2, y ). 
Therefore 
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where the operators are now operating upon the function 


F(x,y). 
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To develop (e~A)le =) into a Taylor series it 


is necessary to develop (ea y - - into a Taylor 
series and then divide by uv. This becomes after @ » @ replace 
u and v respectively, 
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THE STANDARD ERROR OF A MULTIPLE 


REGRESSION EQUATION! 


By 


Joun Rice MINER 


Since a multiple regression equation is essentially a hyper- 
plane, fitted by the method of least squares, its standard error 
may be obtained from Gauss’ standard error of a function recently 
discussed by Schultz (1930). Let the equation be 


--+8 
Jm.23---€m-1)*m 
where w, is the dependent variable, 2, 4, -- ++ %,,the in- 


dependent variables, each measured from its respective mean, and 


©,0.98 - at? * 7 


sion coefficients. 
(10) becomes 
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1m.23++-(m-1 the partial regres- 
Then the determinant of Schultz’s equation 
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1From the Department of Biology of the School of Hygiene and Public 
Health of the Johns Hopkins University. 
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Let A ,; be the cofactor of the element in the ¢ ’th row 
and the y ’th column. Then 


[«.]= s 


[44 ]= 2,,/0 -A ,,/702h , 
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[~f]= 22/0 =9, 
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Therefore, substituting these values in Schultz’s equation 
(27.1), we have 


- 77) 





322 


ERROR OF A MULTIPLE REGRESSION EQUATION 


2 
“i nee (23-- am) 


(2) 


as 2a" +- ae. 


Se vol 
2 Xn 
és 


For a simple regression equation this reduces to 
O, ( 2/3 af 1é 
0, = a /-7 /+ =, 


This agrees with the expression given by Pearson (1913), 
if we remember that 2c, is measured from its mean and that 
Pearson does not correct for the number of parameters 

For a regression equation with two independent variables 


Og, 


 — (-e7 @ 
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As an example of the application of this formula we may 
calculate the standard error of the mean heart-weight (X,) of 


; 
the array of persons with an age (X,) of 52.92 years and a 
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body-weight (X3) of 49.93 kilograms in a population of 213 
persons characterized by the following biometric constants: 
348.9 g; ao, =794 g; Ne =+0.114 
59.65 yrs.; > = 17.54 yrs.; ly,3=* 0.652 
56.45 kg; “* 14.38 kg; 557 —0.185 . 


From these data 4, 4 = + 0.315 and r,,, = + 0.689 
and the regression equation of heart-weight on age and body- 


weight is 


X, = 66.09+/,1/00X, + 3.848%, 


from which the mean heart-weight of persons aged 52.92 years 
and weighing 49.93 kg. is 316.4 g. 

Substituting the appropriate values of the constants in (4) 
and remembering that w, = X, -%,=-6.72, «,=X,- M, 
=- 6.52 , and 


(tO 8 = (1-75 )# a rz )# 


— 993) (0 res){r Cony + aoe y 
g,= ——z a comma ¢- stdmalianaaluesmeen 
f 2102 (0 ' 07.54)" (0.966) (14.38 )° (0,966) 


_ 2(-0.185)(- a) = 47 
(7.54) (14.38) (0.966) “8° 
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SAMPLING IN THE CASE OF CORRELATED 
OBSERVATIONS 


By 
Cecit C. CRraic 


National Research Fellow 


Dr. E. C. Rhodes, in a paper in the Journal of the Royal 
Statistical Society, has considered the distribution of character- 
istics of samples of NV when the individual observations are not 
assumed to be independent. As he points out, there are many 
important cases in which the usual assumption of independence 
or randomness in the observations is not justifiable. In the pres- 
ent paper will be explained a method based on the semi-invariants 
of Thiele for the calculation of the characteristics of the sought 
distributions in this case which is especially to be preferred to the 
method based on moments when it is supposed that the observa- 
tions are normally correlated. In the case it is further assumed 
that only consecutive observations are correlated, in addition to 
Dr. Rhodes’ results, the third semi-invariant (which is the same 
as the third moment about the mean) of the variance and the 
mean and the variance of the third and fourth moments about 
the mean are given. 

Let the N observations composing a sample be given by val- 
ues of x,, x2, ---,.x,respectively. and let Fy (x,,2,-- ty) 
be the 77 -way probability function of w,,2,, -- - 


1The Precision of Means and Standard Deviations When the Individual 
Errors Are Correlated, Vol. 90 (1927), pp. 135-143. 
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Then the semi-invariants, 


-of x x, 
defined by 


rst ae 


5 Bat 48 (Eat)? +§ (Bat) 


[em (x, x, ghee? 


-+,7 eo 


which is to be regarded as a formal identity in ¢,, ¢,,--.t,. 
(2 fa. t;) () is first expanded by the nniibomalel ‘on and 
then cou term Ar AS AS --+in the result is replaced by 
rest 

We shall pass over the characteristics of distributions of 
means, since the method of semi-invariants is equivalent to that 
of moments in this case, ‘and take up the distribution of moments 
about the mean in samples of N. Following the method pre- 


viously used by the author in the case of independent observa- 
tions,” let 


N 
=2,a;;x; with 
$1 sj Ss 


Then let V ( 6. &. _ ” be the probability function of 
6, bys °*>s Sunedd ie = 0. The semi-invariants Nest * 
of 4, a ** * % 6,,-, are defined by 


1Following Cramér, I distinguish between probability and frequency func- 


tions. Fy, (ac,, Lo, °--, S,)is the “cumulative” frequency function 


and thus the integral is an 7-way Stieltjes integral. 


2An Application of Thiele’s Semi-invariants to the Sampling Problem; 
Metron, Vol. 7, No. 4 (1928), pp. 3-75. 
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and as the author has previously remarked,! we can also write 
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so long as the relation is only used to find the values of d'. ae? 
in which at least one of the subscripts is zero. 

Then S, (v,) , the & ’th semi-invariant of the 77 ’th 
moment about the mean in samples of /V, is given by the formula 


Sy (Yq )= 
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loc. cit., pp. 18, 19. 
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tie uvtation V, referring to moments of 6 
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In particular: 
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Ou writing out the moments Voy w in terms of the semi 
° . ‘ ° ‘8 . 
invariants A psi = and then using (4+) the sought semi-invariants 
aie obtained , 

In the case that the MW observations are normally correlated 
aud Fy (x,, 2, °°*,,)is the N -dimensional normal prob- 
ability function, the k it-hand member of (4) vanishes for K 2 3. 
are all equal (which we shall always do) and take as the simplest 


Lf we suppose that the standard deviations of X,, % 


case that XZ, , Ze. “+, 2, are worimially correlated ana that 


1Sec the author’s paper cited, p. 21, tocmula (25) 
*For a detailed explanation of this kind of calculation see the author’s paper 
cited, pp. 23-27. 
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the correlation as measured by the Pearsonian coefficient, r, ). , 
. ; “ij 
is the same for each pair, 2, , x; of the set of MW observations, 
we get 

N= N= XM 


ot » ft. 
20° “oz0 “oo20 = = = ~ WwW (ryp- Au)* nN (7 TAL, 







‘ , ‘ £ 7 
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if the common value of Fr, ,. be denoted simply by r. But 
4 

if the observations are independent and the parent population is 

normal we have 





sy! _ N-! 
r20 = Novo = Acoso = °° = N Aug s 
a a es -x a <a 
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Thus it follows that the distributions of the characteristics 
of samples of NM in this particular case af dependent olserva- 
tions are the same as if the observations were independent and 
taken from a normal population of variance ( /-r) x a. 

In case F,, (xc,, %,--,2,)is normal it is convenient to 
express the right hand members of (5) directly in terms of the 

‘ 
semi-invariants Rent, _.for 7=2, 3, 4. For that purpose we 
w 
shall adopt the following notation. Let the linear form 4, ay A; 
be denoted by A;. Then (4) becomes 


6 (Eat) = Eat 








Thus in a symbolic sense A;’s and A; ’s are equivalent. But with 
regard to the subscripts of the A terms in the expansion of the 
left member of (6) we use a different convention than for the 
subscripts of the A’s. We set 


me. 


See the author, loc. cit., p. 19. 
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o- A... 


/ 
5, (V.)= HZA;;» 

/ 2 er. oe ; 
S,(V,)= We [GEA +2ZA,, A, t4ZAL (2A 2" lids, 


the summations, of course, running over all values of ¢ and J 
from / to N. But since 


EM. + 80a, Aj, (EA; )* 
the second relation reduces to 
5,(0,)= ge (EAZ, +2E a5). 
Similarly 
5,(V,)=F5 (EA; +3ZA, A; OLA; Ary Ajg ), 
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2 2 
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9, (V,)=0, 

3 3 3 
5, (V,)= y2(5SZA;, +62ZA;; A, A,; +4ZA,; ), 
S,(V,)=9, 
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ee >? 


S, (V,)= HZAF 


46 a 2 2 
5,(Va)= te (ZEAS, o3ZA; Ay AY, HEAR, ). 


To illustrate the use of these formulas and to give some 
results in a case of practical interest, let us suppose that the set 
of N observations composing a sample may be assigned an order 
in which only consecutive observations are correlated and in a 
constant degree. Thus our observations might be prices or indices 
taken at the ends of consecutive time intervals. We suppose, then, 
that 


= 2X = 
ool o1o/ 


The first step in the calculation is to obtain the values of the 
various A's which entcr into the formulas (7). A,, is found 
. 2 Pa ‘ ' 7 
iron Ar, Aj,yfrom A, A,and so on. We get 


* a. 2 
Bay Pigs, (i~£- $3) A, . 


4+2r 
Aig, “Pigg ~s = (1-H 


N-1, N-t 


‘ a. 
Ai2 ce “tr N Ne ) A 20 
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Then, on substitution in (7), we have finally 


20’ 


S,W,) = 5[U-A- F +2r 2-H + Se + Hs)] D, 


S,(V,)= (1-4 Ye SE A 


These two results are given by Dr. Rhodes, loc. cit., though 
there is a slight misprint in the second one as given there. The 
remainder of the results given here are believed to be new. 


5,(V,)= <3 [ OHM Hor 20-3 +82 +25)- 4° (2-2-2 SNK, 


5, (V,)=0 , 
5,(u,)= 2[G-4ueZr0-$F)- SF (1-§ + 25 )+ 2 1- Z + Se 


gs ed 4 = A 
No Wt 20” 


S,(V,)=0 , 


S,(v,)-3|0- a 0-4 )4 4% (I a) Az, 


5-(vy)= 34 [=f ne-2 + SaX1- 82) Ortll- fe He- Bhs th eB 


Br? 19,33 , 30 54 168 9,44 64 
Nt Net Ne wt WSIteOU it Ne” v8 


W4 192 , 360 + 288 | ag 
~ eee 
n* n> N 6 =) 
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It should be observed that the expressions for 5 (v,,) for 
N <3 and for S, (V,,), k 7 @ for N<5 are in general not valid, 
since it can be seen by reference to (8) that all the types of A ’s 
used in the formulas (7) do not exist for values of NW so small. 
But for these small values of N , the values of the characteristics 
for which expressions are given above can be readily computed 
directly. 


Stanford University. 















THE RELATION BETWEEN THE MEANS 
AND VARIANCES, MEANS SQUARED AND 
VARIANCES IN SAMPLES FROM COMBINA- 
TIONS OF NORMAL POPULATIONS 





By 


G. A. BAKER 


















The distributions of the means and variances of samples 
from the combinations of normal populations have been discussed 
in a previous paper.’ It is known that if the sampled population 
is not normal the means and variances of samples are not 
independent. 

The present discussion aims to give some idea of the relation 
between the means and the variances, means squared and vari- 
ances of samples from a population that is the combination of 
normal populations. To this end the case of samples of two from 
such populations is rather completely investigated. Also empir- 
ical random sampling results for two special populations are pre- 
sented. 


Suppose that a population is represented by 


-Lac2 ad -777 - 
(1) F (x)= jie — +A e $y | 








— 


“Random Sampling from Non-Homogeneous Populations,” Metron, Vol. 
VIII, No. 3 (1930), pp. 1-21. 
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If a method used by Karl Pearson! is followed, the prob- 
ability - of 
x, in dx, is f(x, dx, , 


.%,in dais F(x) da, 


and the probability of the concurrence of these two events is 


(2) Fla Jtl(x dx, dx, 


which may be written 


é ' 
whip fe intel ee -deallar-mirtz,-mi 


(3) 


2, (£,~ 1m)? 
,* fe tle. ’ ae 
o 


= 3 (L,+4,) 
27:4 | (x, -£)*+ (x,-x)? | 


Whence 


fe 2, + 


a2 E. Oa - 


@a, Ax, may be replaced? by 
1Appendix to Papers by “Student” and R. A. Fisher, Sionietrika, \ ol. ALIX 
(1925), p. 522. 
2R. A. Fisher: “Frequency Distribution of the Values of the Correiation 


Coefficient in Samples from an Indefinitely Large Population,” Biometrika, 
Vol. X (1915), p. 507. 
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(5) cdiidx 


In virtue of (5), (4) and (3), (6) is obtained. 


/ | -£ex*s2z Ta? i -Sg2[2z 4, 20x: m)?| 
(1+k)}*2i7 - 


2 
cong fo BC tre gem 
og 


2 
ef Cag Hi] 


This is the correlation surface for the means and standard 
deviations of samples of two drawn from (1). To get the corre- 
lation surface of the means and variances write 


Then 


_ blex*eu “a -£,,[2ur2/x m)| 
E "2laae © 


F(t, u)= 7 Eten 


é 
k 7, [Car + x)*+ (va = m)") 
+ Dla 
2 
eo Sl-a+xte (Gex. m) H 


* ava 
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is the desired surface. 
The locus of mean w’s for given x’s is 











- (at-m)* _ a(x- 2)* . 2 
_2 2 (o2-1).+m)} 
a 2 oe ,4V2k 2 {(e?-).x+m ] 
e +kaoe + eC aos lo¥ 
iin (a7 +1)# . | oe! 
(8) m7 \2 
-2(x- 2 
2 
= 


The locus of the mean x ’s for given u ’s is 


ke “9% alge tu - By “als 3); 
= ph eek [ce imiembe +! ~{e? I)fa- ne se | 
9 = tl omeeineset ineone oemt 
_ «ee af -f _2 z)* - 2(-fa- F)~ 
e Le o*, eee fe o24+/ 4¢e S24/ } 


The correlation surface for the means squared (= z ) and 
variances is 


-£[2z+2 me 32 [24 +2lfa- m)*} 
e 








e 
Hl, 2) —_ fa lz * w2 hale 
2. ela +fa)*+(- (a+ -m)*) 
(10) aa 


~4[(-fa+ te) + (as - mm) 
2 a2 


e j] . 
‘alate 
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The locus of the mean wu ’s for given 2’s is 





ve 
- la-m é -2((- >) 2 
7 Aut o2+/ {(e2/)¥e+m } 
2 
e+kaee “fg2ei) e [oz ate? ] 
re ee eee 
- (fé-m) - 2(¥f- 3) 
a vo 247 
The locus of the mean 2’s for given &’s is 
2 = 
“ 2° 7. "2(fa - BZ y\e2 -2(a + B72 
uke F2 2kek 2 
e “7 + SEK fe oes +e Fees 


multiplied by 


—— -<& 
le +£ (0% m* Je g 


alek ox ntm-mj, - 2, 2) 22° 
* Geant {(o* ———— Je gees 


m\2 
o%1) farm “ z) 
a St), 


By expanding the denominators of (8), (9), (11), and (12) 
by the multinomial theorem, it can be shown that each of these 
loci is essentially parabolic, g2/ # 0. They are subject to an 
exponential influence at the beginning of the range of the inde- 
pendent variable, which influence rapidly diminishes as the inde- 
pendent variable takes on higher values. 

The probability relations in general between means and vari- 
ances, means squared and variances will be expected to approx- 
imate those for the case of samples of two, because of the fol- 
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iowing considerations. Suppose that m (the number in the 
sample) is large? When a large proportion of the sample comes 
from the first component, the first term of (7) with 2 in the 
numerator of the exponent replaced by 7 and with u @ replaced 
by « 7 will be an approximation to the surface of the means 
and variances. Similarly, when a large proportion of the sample 
comes from the second component, the second term of (7) with 2 
in the numerator of the exponent replaced by 7 and with u © 
replaced by ce will be an approximation to the surface of 
the means and variances. When about equal proportions of the 
sample come from each component, the last term of (7) with 3 
in the numerator, of each exponent replaced by z and with u 2 

replaced by uw 2 will be an approximation to the surface of 
the means and variances. Or, all together, (7) with the 


mentioned changes in the exponents of the terms, with proper 


weighting of the terms, and with uw “ replaced by % is a 
proportionate approximation to the distribution of the means and 
variances of samples drawn from a population represented by 
(1). Further, increasing 77 will not influence relations (8), (9), 
(11), and (12) as approximations for the general case except 
the exponential term, if it is assumed that the denominators are 
expanded and then multiplied by the numerators, for * occurs to 
the same power in the numerators and denominators. 


Note: The effect of K and of the binomial coefficients is roughly as fol- 
lows. If the 27741 terms denoting S from the first component of (1) 
and 77—s from the second component are divided into thirds, then, if 
4, 4, ; 4, are the exponents of K in the middle terms, 

?- - 57+ 


= = 
or approximate ly, since 77 is large and sire only a proportionate expression 
is desired 4-9 ’ 4, 2g ? 4,2 
or the exponents of K of the middle terms of the three sections above 
are 3 times the exponents of K in (7). The effect of increasing 77 be- 
cause of the binomial coefficients is to weight the middle section of the 
possible surfaces to a much greater extent than the extreme sections, s¢ 
that with 77 very large the last term of (7) with 2 replaced by 7 
becomes an approximation to the desired surface. 
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From (8), (9), (11), and (12) it is clear that the par- 
ameters of the sampled population have great influence on the 
regression relations considered. It should be borne in mind in 
this connection that many flattened and skewed, as well as bi- 
modal, distributions can be adequately represented by combina- 
tions of normal populations. Also, results (8), (9), (11), and 
(12) can be extended to the sums and differences of any number 
of normal curves, subject to the condition that the resultant is 
always positive. 

In 1925, Dr. Neyman! gave the correlation coefficient be- 
tween the deviations of the means of samples from the mean of 
the sampled population and the variances of these samples for 
samples of 77 drawn at random from an infinite uni-variate popu- 
lation in terms of the betas of the sampled population as 


V( 77-1) 3, ~ m+ 3 


(13) p' In-T VB . 


Similarly, the correlation coefficient between the deviations 
squared of the means of samples from the mean of the sampled 
population and the variances is 


‘ In-l (48-3) 


(14) R= ae ° 
(4, +2n-3) [(7-1) 8a" 7+ 3] 


Under certain very special conditions the statement of 0’ 
and &' may give an adequate idea of the regression relation 
between the means and variances, means squared and variances 
of samples from a population represented by (1). In general 
the mere statement of these coefficients will not give any useful 


1J. Splawa-Neyman: “Contributions to the Theory of Small Samples 


Drawn from a Finite Population,’ Biometrika, Vol. XVII (1925), pp. 
472-479. 
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notion of the actual probability relations. This is true because: 
(a) the regression relations between means and variances, means 
squared and variances of samples from a population represented 
by’ (1) are essentially parabolic, as shown for samples of two 
and as seems probable for larger samples; (b) the frequency 
arrays may vary markedly in dispersion, in skewness, and in 
other characteristics. 

To illustrate these remarks, samples of four were drawn 
from two special populations by throwing dice. 
Suppose that a population is represented by 
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Whence 


2 
f- (+k)[-3m, -m,7+ k(3m,o07+ m.)] 
‘ 


16 — . 
( ) [+ 2,4 k (72+ m,* )] Ji 


g : (1+k )[3+6 mes m *+k( 30 *+ 6 mo 7+ m7,* )) 


an) [14 m,2+k(a 24 mz )\* 


Thus, for any special population of the form (15), fe’ and 
f?' can be easily calculated. 

Samples of four were drawn from a population approx- 
imately represented by 


{ -g (att. i / -$ (2-17) 
(18) 4 (2z)= 648l on é + Br e 


The actual sampled population is shown in Chart A and is 
hereinafter called Population I. 

Table I shows the distribution of 1038 samples of four drawn 
from Population I with respect to the observed values of the 
means and the variances. The arrays for constant values of the 
variances are at first distinctly bimodal, gradually becoming uni- 
modal. Chart I shows the means of arrays of Table I with the 
regression lines as calculated without correction for groupings. 
It is apparent that the locus of the mean variances for a given 
value of the means diverges a great deal from a straight line. 
This regression relation looks as though it was a normal curve, 
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which 1s what would be expected from (8) with o@/=0. The 
theoretical and actual correlation coefficients for this and three 
subsequent tables are compared in Table V and the constants of 
the marginal distributions of Tables I to IV are presented in 
Table VI. 

If the deviations of the means of the samples of Table I 
from the mean of Population I are squared, Table II results. 
Chart II shows the means of arrays and regression lines of Table 
II. The regression lines are very poor fits to the means of the 
arrays which are, apparently, exponential loci. 

Table III shows the distribution of 1058 samples of four 
drawn from a population approximately represented by 


= l 2 
1 -g(4+8) > -3(2-2.4) 
(19) f(ad= s7d gz e "3/en € | 





with respect to the observed values of the means and variances 
of the samples. The actual sampled population is presented in 
Chart B and is hereinafter called Population II. Chart III shows 
the means of arrays and regression lines of Table III. This chart 
resembles Chart I in that the locus of the mean variances for given 















values of the means is so obviously non-linear. Also, a glance at 
Table III is sufficient to see that the arrays vary markedly in 
skewness. 

Table [V shows the relation between the means squared and 
variances of samples of four from Population II. Chart IV shows 
the means of arrays and regression lines for Table IV. In this - 
case the regression relations seem to be fairly near linear, and the 


frequency distributions of the arrays do not change strikingly. 
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CHART I 


The Means of Arrays and Regression Lines of the Means and 
Variances of Samples from Population J 
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CHART II 


The Means of Arrays and Regression Lines of the Means 
Squared and Variances of Samples from Population I 
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NOTE: The last thirteen class intervals of the means squared are grouped 
into one group. 
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CHART III 
The Means of Arrays and Regression Lines of the Means and 


Variances of Samples from Population II 
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CHART IV 


The Means of Arrays and Regression Lines of the Means Squared 
and Variances of Samples from Population II 
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TABLE V 


Correlation Coefficients of Tables I-IV 


| Correlation-Coefficient 





TABLE VI 


Constants of the Marginal Distributions of Tables I-IV 
in Terms of Class Intervals 







Standard 
Deviation 









Marginal Distribution 





Means of Samples from Population I 


Variances of Samples from Population I 2.900 
Means Squared of Samples from Population I 3.203 
Means of Samples from Population II 2.237 
Variances of Samples from Population IT 2.854 


Means Squared of Samples from Population II 


1Calculated without corrections for grouping. 

“Is so far from zero because of the groupings employed. Many means were 
exactly odd integers. These were all put forward into higher classes, mak- 
ing the calculated mean too large. 

4Origin taken at the beginning of the range. 
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From the results for the case of samples of two and from 
the results of empirical sampling, it seems clear that the simplest 
regression relation that is generally applicable to the means and 
variances, means squared and variances, of samples from popu- 
lations which are the combinations of normal populations is para- 
bolic. For small samples and for certain values of the parameters 
of the sampled population the regression relations may involve 
exponential terms that are quite important. As the size of the 
samples increases, it is expected that this exponential term will 
decrease in influence. It seems plausible that even with large 
samples the regression relation of means and variances, means 
squared and variances will remain essentially parabolic. It is not 
expected that the determination of a good approximation to the 
regression relations will serve to give an adequate notion of the 
probability relations of the means and variances, means squared 
and variances of samples from a population represented by (1), 
because the arrays may vary in number of modes, in skewness, 
in dispersion, and in other characteristics. For instance, surface 
(7) may be trimodal so that arrays may be bimodal or unimodal, 
and in such a case the arrays must vary markedly. Surfaces 
(7) and (10) with 2 replaced by 77 and with the terms suit&ly 
weighted are valuable approximations to the probability relations 
of the means and variances, means squared and variances of 
samples drawn from a population represented by (1). 


DA. 6ohe— 





A TABLE TO FACILITATE THE FITTING OF 


CERTAIN LOGISTIC CURVES 


By 


JosHua L. BalLey, Jr. 


Tne most useful generalization of the logistic curve is that 
having the form 


a 
(1) gY* [tet t Ox tcuteguy.-.. 


In practice it will seldom be found necessary to use higher 
powers of z . This equation may also be written 


(2) Y=a+bx+cx*+ox? 


in which Y= log af. 

If we can evaluate the constant K with reasonable accuracy, 
the value of Y corresponding to each observed value of y can 
be computed, and then the values of the coefficients @, & c, and 
g , in equation (1) may be obtained by fitting equation (2) ‘as 
a generalized parabola by the method of least squares. 

The normal equations necessary to make this fit will be found 
to be 


ada” +biwc +¢Fae* +g Ea* «ZY 


eta «+ &6fa* +eoLxr +g Zx* «Laxey 


alia* + 622* ela” «ag x* 


Zx*y 


@Zx2 + bLx* +eTx + gFun® =O xy. 


“ 
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In the special case where the observations have been made 
at regular intervals (that is, where the successive values of x are 
in arithmetic progression) the solution of these normal equations 
may be greatly simplified. We may then select an arbitrary origin 
in the middle of the range of observations, so that for every 
positive value of x there will be a corresponding negative value 
of eqitat-absolute magnitude. Thus the sums of the odd powers 
of .x will all be zero. 

If the number of observations be odd, the middle one will, 
of course, be chosen for the origin, and the unit of the scale will 
be the interval between successive values of x . If the number 
of observations be even, the origin will be midway between the 
middle pair of observations, and it will be found more convenient 
to take half the interval as scale unit. In the former case, x 
will take all integral values between + 77 and - 7 , while in the 
latter case 2 may take only the odd integral values. 

If we set the sums of the odd powers of x in the normal 
equations equal to zero, and solve them simultaneously, we de- 
rive the following formulae for the literal coefficients : 


An ZVEX*ZXPRY-EX?, _ EXEV-EX* ZY EX* 


ZX*EX°- (LX? )? EX* EX (EX 7)? 
peZXVEXS ER VEX" 6 _EK*VEXZEXY EX” 
ZX * 5 x*-(Zx*)? ZX*- 2 X*-(F x*)* 


The use of capital letters indicates that the equation has been re- 
ferred to the arbitrary origin. 


In these formulae the factors involving Y must be com- 
puted from the observations, but those in which X alone occurs 
may be tabulated for all convenient values of 7. Since Y does 
not occur in the denominators at all, these may be tabulated in 
the same way. 
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Finally, the sign of G is determined by the direction in which 
the curve approaches the asymptote gw=0, and this may readily 
be told by inspection. ut it not imtrequentiy happens that 2 
. slight error in one of the observations may be sufficient to give 
G the wrong sign. In this case the limits between which the 
observations were taken must be changed, or a new value of k 
must be tried, or the faulty observation must be adjusted by a 
smoothing formula. It is obviously important therefore that some 
means be provided for determining the sign of G before the 
values of the coefficients are determined. 3 a 

The conditien that G shall be negative is-2X >=, 
The second term in this inequality may be tabulated in the same 
way. The accompanying tables show the values of the functions 


eu’ ou? 2a; Ex" oxen -any 
ZX*EX* (Lx) and EX *+ OX? 


for all values of 77 from 0 to 25 when the number of observations 
is odd and from 0 to 49 when they are even. 

In the preparation of these tables, my thanks are due to the 
Zoological Society of San Diego for the use of the facilities 
afforded by its research department. 


Yeohun £ Paitey be. 






























THE GENERALIZATION OF 
STUDENT’S RATIO#* 


By 


Haroitp HOtTeELLING 





The accuracy of an estimate of a normally distributed quan- 
tity is judged by reference to its variance, or rather. to an 
estimate of the variance based on the available sample. In 1908 
“Student” examined the ratio of the mean to the standard devia- 
tion of a sample.1 The distribution at which he arrived was 
obtained in a more rigorous manner in 1925 by R. A. Fisher,° 
who at the same time showed how to extend the application of 
the distribution beyond the problem of the significance of means. 
which had been its original object, and applied’ it to examine 
regression coefficients and other quantities obtained by least 
squares, testing not only the deviation of a statistic from a hypo- 
thetical value but also the difference between two statistics. 

Let E be any linear function of normally and independently 
distributed observations of equal variance, and let s_ be the es- 
timate of the standard error of & derived by the method of 
maximum likelihood. If we let t be the ratio to § of the devia- 
tion of & from its mathematical expectation, Fisher’s result is 
that the probability that ¢ lies between f and 7, is | 


*Presented at the meeting of the American Mathematical Society at Berk- 
eley, April 11, 1931. 

1Biometrika, vol. 6 (1908), p. 1. 

2Applications of Student’s Distribution, Metron, vol. 5 (1925). p. 90. 
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where 77 is the number of degrees of freedom involved in the 
estimate $s. 

It is easy to see how this result may be extended to cases in 
which the variances of the observations are not equal but have 
known ratios and in which, instead of independence among the 
observations, we have a known system of intercorrelations. In- 
deed, we have only to replace the observations by a set of linear 
functions of them which are independently distributed with equal 
variance. By way of further extension beyond the cases dis- 
cussed by Fisher, it may be remarked that the estimate of vari- 
ance s* may be based on a body of data not involved in the 
calculation of & . Thus the accuracy of a physical measurement 
may be estimated by means of the dispersion among similar 
measurements on a different quantity. 

A generalization of quite a different order is needed to test 
the simultaneous deviations of several quantities. This problem 
was raised by Karl Pearson in connection with the determination 
whether two groups of individuals do or do not belong to the 
same race, measurements of a number of organs or characters 
having been obtained for all the individuals. Several “coefficients 
of racial likeness” have been suggested by Pearson and by 
V. Romanovsky with a view to such biological uses. Romanovsky 
has made a careful study! of the sampling distributions, assuming 
in each case that the variates are independently and normally 


1V. Romanovsky, On the criteria that two given samples belong to the same 

normal population (on the different coefficients of racial likeness), Metron, 
vol. 7 (1928), no. 3, pp. 3-46; K. Pearson, On the coefficient of racial 
likeness, Biometrika, vol. 18 (1926), pp. 105-118. 
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distributed. One of Romanovsky’s most important results is 
the exact sampling distribution of 1. , a constant multiple of the 
sum of the squares of the values of t for the different variates. 
This distribution function is given by a somewhat complex in- 


finite series. For large samples and numerous variates it slowly 
approximates to the normal form; for 500 individuals, Roman- 
ovsky considers that an adequate approach to normality requires 
that no fewer than 62 characters be measured in each individual. 
When it is remembered that all these characters must be entirely 
independent, and that it is usually hard to find as many as three 
independent characters, the difficulties in application will be ap- 
parent. To avoid these troubles, Romanovsky proposes a new 
coefficient of racial likeness, H , the average of the ratios of 
variances in the two samples for the several characters. He ob- 
tains the exact distribution of H , again as an infinite series, 
though it approaches normality more rapidly than the distribution 
of L. . But H does not satisfy the need for a comparison between 
magnitudes of characters, since it concerns only their variabilities. 
' Joint comparisons of correlated variates, and variates of un- 
known correlations and standard deviations, are required not only 
for biologic purposes, but in a great variety of subjects. The 
eclipse and comparison star plates used in testing the Einstein 
deflection of light show deviations in right ascension and in declin- 
ation; an exact calculation of probability combining the two least- 
square solutions is desirable. The comparison of the prices of a 
list of commodities at two times, with a view to discovering 
whether the changes are more than can reasonably be ascribed 
to ordinary fluctuation, is a problem dealt with only very crudely 
by means of index numbers, and is one of many examples of the. 
need for such a coefficient as is now proposed. We shall gener- 
alize Student’s distribution to take account of such cases. 
We consider p variates , , Lore so Lp , each of 
which is measured for N individuals, and denote by Xj, the 
value of x; for the « th individual. Taking first the problem 
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of the significance of the deviations from a hypothetical set of 
mean values /77,, 72, + + Mp, We calculate the means 
My sdge + + 4 Xp oi the samples, and put 

» Xp: 


§; =(, -m,;) YUN . 


[hen tae co. vaiues of the gE j will all be zero, and the vari- 
ances anc covar.ances will be the same as for the corresponding 
Xj; ; since the individuals are supposed chosen independently from 
an infinite population. In order to estimate them with the help 
of the deviations 


x, * Rin — &; 


irom the respective means, we call 7 =.N - 1 the number of 
jegrees of ireedom and take as the estimates of the variances and 
covariances, 


‘Mea * asd i. the sense oi mathematica! expectation: the 
1a y ¥ - mean value is zero is defined as the expecta- 
tion of ‘ the covariance of two such quantities is the expectation 
of heic ,.ocuc.. Thus the correlation of the two in a hypothetical infinite 


population is the ratio of their covariance to the geometric mean of the 
variances. 
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_a.... cofactor of a2;; ina 
(3) Ajj = Aji ©  —nmnggonlmmnene 
a 
The measure of slaadaensons deviations which we shall em- 
ploy is 


PP 
(4) T°32E Ay EiG;. 


d=/ fel 


For a single variate it is natural to take A,, = 4, ; then 
T reduces to ¢ , the ordinary “critical ratio” of a deviation in a 
mean to its estimated standard error, a ratio which has “‘Student’s 
distribution,” (1). For examining the deviations from zero of 
two variates x andy, 


/ N-/ 2 N-/ 


2 (X-z)(Y-g) 
v* 2 = ie 
V2Zi(X-x) Z(Y-9) 


For comparing the means of two samples, one of JV, and 
the other of JV, individuals, we distinguish symbols pertaining 
to the second sample by primes, and write 
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(6) _ as 2 Oia FX ju, %) +E(X;., x. Xx a 


» aA 


“AEX, XN 


ia ju 


and take as our “coefficients of racial likeness” the value (4) oi 
T*, in which the E ; are calculated from (5) and the A ij 


from (6) and (3). 

Other situations to which the measure 7’* of simultaneous 
deviations can be applied include comparisons of regression co- 
efficients and slopes of lines of secular trend, comparisons which 
for single variates have been explained by R. A. Fisher. In 
each case we deal for each variate with a linear function € i; of 
the observed values, such that the sum of the squares of the co- 
efficients is unity, so that the variance is the same as for a single 
observation, and such that the expectation of & ; is, on the hy- 
pothesis to be tested, zero. Deviations x;, of the observations 
from means, or from trend lines or other such estimates, are 
used to provide the estimated variances and covariances 4; j 
by (2). The number of degrees of freedom 7 is the difference 
between the number JV of individuals and the number g of 
independent linear relations which must be satisfied by the quan- 


1Metron, loc. cit., and Statistical Methods for Research Workers, Oliver 
and Boyd, third edition (1928). 
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tities Bij» Zar + + » Z;,y On account of their method 
of derivation. For all the variates, these relations and nm must 
‘be the same. 

The general procedure is to set up what may be called normal 
values 2X; for the respective X,. , putting 


(7) iy = Xi = 


The underlying assumption is that Pia is composed of two 
parts, of which one, €;, , is normally and independently dis- 


2 which is the same for 


tributed about zero with variance 0; 
all the observations on 2;. The other component is determined 
by the time, place, or other circumstances of the ~’th observation 
in some regular manner, the same for all the variates. Denot- 


ing this part by 7);, , we have 


Xin “Ria * Cin - 


Specifically, we take 7;, to be a linear function, with known 


coefficients g, , of q unknown parameters Cir - . +G, q 
where Qq<N: 

q 
(8) Nie “2 Jus Sis: 

$= 


Thus in dealing with a secular trend representable by a poly- 
nomial in the time, we may take tne g’s as powers of the time- 
variable, the 6 ’s as the coefficients. For differences of means, 
the g ’s are 0's and l‘s, and the G, 's the true means. 

We estimate the © ’s by minimizing 
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Substituting from (8), differentiating with respect to FF 5° and 
replacing 77; y by Xi for the minimizing value, we obtain: 


N 
(102) 2 Qus Xin Zu) *9: (s=12,--+,4) 
w= / 
or by (7), 
N 
(MM) D” ug Xin 2? G=12,°°.9) 
a =/ 


Denoting also the minimizing values of Gi by 2/5 . we 
have made from (8), 


q 
o o Gas * is 
Subtracting (8), 
_ q aad 
(12) Zin Nia 2 dus (2.5 ~ Sis) 


From (9), 


eV -Z, [ X,.-F0)* (Z;, “in )| ; 


N N 
(13) =. (Kg -Ej4) +22 Kig- BNE - 7,.) 
et =/ “~=/ 
x 2 
+h, (Bi0~ Vig) 


The —s term, by (12), equals 


eX E 955 (Xju- ZiaX Fis ~Sis)) 


=/ Ss=/ 
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this, by (10), is zero. Hence, by (7) and (13), 


U. =V. +W,, 


where 


2 Wi; <2 CZ sy,- Vian y 


If the q equations (10) be solved for XY 1 Kas 
Zin , the values of these quantities will be found to be homo- 
geneous linear functions of the observations ) Ry (7), 
therefore, the quantities 


Zip, Meo, baci 2 > Een 


are homogeneous linear functions of the Mia: But they are not 
linearly independent functions, since they are connected by th: 


g relations (11). Hence V is a quadratic form of rank 


ne N-@.. 


Since V; , by (9), is of rank NM, W is of rank 9: 


, . °,° / 
This shows that Np new quantities x 


ig » given by equa- 


tions of the form 


N 
“2,6 Mt 


N 
/ 
Xin S Kua Xig Al «8 igr(%=42,: a) 


Z 


(14) 


N ; ‘ 0 Nv 
) -- 
Lik Sup Xip Up) £, (4X4 Gaya uae (e= m4 N) 


can be found such that 
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e.» se » 
2V,~2. Zig =. Lin > 
(15) 


ZW. Ss xis 


t Ge 74/ ioe? 


and therefore 


2 
~ 


ad 
(16) 2U;*2 xi 
“=/ 


Substituting (14) in (15) and equating like coefficients, 


7 
(17) a « c 


£, “ag “ay =% 


BT 
where &gy is the Kronecker delta, equal tol if @=7 , to 
0if SAY ° 

The coefficients c a depend only on the gv, , which 
have been assumed to be the same for all the variates. Thus 
(14) may be written 


N 


wv 2) xX 


Multiplying by (14), summing with respect to ~ from 1 to 7, 
and using (17), 
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Just as in (2), we define a, in this generalized case by 


(19) aii 


Then by (18), 
(20) aij 


Of the last equation, (6) is a special case. 
The random parts €, of the observations on 2; have 
by hypothesis the distribution 


/ -U;/2 0,7 
waar * , AE,, Eg °* aE 
6” 


where V; is given by (9). From what has been shown, it is 


tN” 


clear that this may be transformed into 


«<tial” 2 ,@ 2 
(x;y - X i as 0 wah Ziv Yeo; 


/ z 
e ax,,°°- 


(a, 27)” 


showing that Zi es we x; are normally and independently 


me 


distributed with equal variance dg; 

The statistic i must be independent of the quantities 
iy 7 Tio» - + Bi entering into (20), its mean value 
must be zero, and its variance must be oe. These conditions are 
satisfied in the cases which have been mentioned, and are satis- 


fied in general if é; is a linear homogeneous function of 
‘ . . . « wi, with the sum of the squares of the 
ad 774/ tn 


coefficients equal to unity. 


The measure of simultaneous discrepancy is 


> PP 


ézh yzl 
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Ai; 
that 


being defined by (3) on the basis of (19). It is evident 


as appears when the numerator is expanded by the first row, and 
the resulting determinants by their first columns. = 

A most important property of J” is that it is an absolute 
invariant under all homogeneous linear transformations of the 


variates Ziv + 1 Lp- This may be seen most simply by 
tensor analysis; for & ; is covariant of the first order aid A; j 
is contravariant of the second order. 


The invariance of J shows that in seeking its sampling 
distribution we may, without loss of generality, assume that the 
variates Zr. + 4 Xp have, in the normal population, zero 
correlations and equal variances for they may always by a linear 
transformation be replaced by such variates. 

Let us now take 


4 / 
e2 Sys Migr -" * 
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as rectangular coordinates of a point 7 in space Vo a, Of n4/ 
dimensions. Since these quantities are normally and independently 
distributed with equal variance about zero, the probability density 
for F? has spherical symmetry about the origin. Indefinite rep- 
etition of the sampling would result in a globular cluster of rep- 
resentative points for each variate. Actually the sample in hand 
fixes the points F , i a me me ae P , which may be regarded 
as taken independently. 

We shall now show that 7 is a function of the angle 0 
between the & -axis and the flat space Vp containing the points 
a F, and the origin 0. We shall denote by A. 
the point on the & -axis of coordinates 1,0,0, . . ., O, and by 
a the flat space containing the remaining axes. Since in Trees 
one equation specifies V, and 7+41-p equations Vp. the 
intersection of V_ and VY, is specified by all these 74#2-p 
equations, and is therefore of p- 1 dimensions. Call it Vp-, 

If Pr ; F ieee e be moved about in Ve. 6 will 
not change, and neither will 7’, since J is invariant under linear 
transformations, equivalent to such motions of the F . Hence 
T always has the value which it takes if all the lines OP : OF, ‘ 

. OF, are perpendicular, with the last p-1 of these lines 
lying in Vp-;. In this case the angle AOP cquals @ 


Applying to the coordinates of A and of / the formula for 


the cosine of an angle at the origin of lines to ( Lr Bers +) 
and Cy, a .), namely, 
(22) cos oS 2 ty _ 


YZ x2*Z y? 


We obtain 


cos @ = 


/ —o 7 
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2 il 


. / 
Since 2, +--+ -- +2, = 7a,,, 


it follows that 
2 
(23) n cot* @ = Cray 


The fact that PE eas + a He lie in Vwi , and there- 


fore in Vv, , shows that in this case 


ee eee hee 


Because OF , OF s+ * OP 5 are mutually perpen- 
dicular, (20) and (22) show that @;; 


ae O whenever ¢ # / 
Hence, by (21) and (23), 


(24) T=£,/e,, = Jf cote. 


By this result the problem cf the sampling distribution of 
T is reduced to that of the angle @ betweena line OA in V,,, 
and the flat space V, containing p other lines drawn inde- 
pendently through the origin. The distribution will be unaffected 
if we suppose V,, fixed and OA drawn at random, with spher- 
ical symmetry for the points A .2_ Let us then, abandoning the 
coordinates hitherto used, take new axes of rectangular coordin- 


ates Y , Yrs + 9 Yp,, of which the first p lie in Vp , 
A unit hypersphere about O is defined in terms of the general- 


1This geometrical interpretation of 7’ shows its affinity with the multiple 
correlation coefficient, whose interpretation as the cosine of an angle of 
a random line with a V, enabled R. A. Fisher to obtain its exact 
distribution (Phil. Trans., vol. 213B, 1924, p. 91; and Proc. Roy. Soc., 
vol. 121A, 1928, p. 654). The omitted steps in Fisher’s argument may 
be supplied with the help of generalized polar coordinates as in the text. 
Other examples of the use of these coordinates in statistics have been 
given by the author in The Distribution of Correlation, Ratios Calculated 
from Random Data, Proc. Nat. Acad. Sci. vol. 11 (1925), p. 657, and 
in The Physical State of Protoplasm, Koninklijke Akademie van Weten- 
schappen te Amsterdam, verhandlingen, vol. 25 ( 1328), no. 5, pp. 28-31. 
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ized latitude-longitude parameters OF ee ae LD, if we put 


= SiN 0 sind, sing, “+: Sin Ip-4 ©O8 @, 
=COS g sing sing, = sing, ,cosf, 


cosh, sin$, Sind, cos, 


cosh: °° Sin Q,..1 cos g. 


cos, cos %, 
Ypsi* sing, cos O41 


Vaus* sing, sin $o,)°° Sin bes 


for the sum of the squares is unity. Since 


2 waco 29 
Yp+ ? Gra, = SUN Pp 
we have 


Q,-=0. 


The element of probability is proportional to the element of gen- 


eralized area, which is given by 


WDag,d¢,.-::--4G,, 


where D is an 7 -rowed determinant in which the element in 
the ¢th row and / th column is 


Qe Oy : OY 
K-1 9%; = 9; 
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For ¢#/ , this is zero. Of the diagonal elements, the first 
p-1 contain the factor cos* Po; the pth is unity; and the re- 
maining -p elements contain the factor sin @ PQ, - Since ~ 
is not otherwise involved, the element of area is the product of 


pl n-p 
cos Pro sin Pp d Pp 


by factors independent of Z, . The distribution function of 











@ is obtained by replacing “Bo by @ and integrating with 
respect to the other parameters. Since © lies between O and 
11/2, we divide by the integral between these limits and obtain 
for the frequency element, 


or (EY 


” ce oe oe ee ee 
r (8) (@B2!) 





i bs 
os” 8 sin . Pade. 


de 


Substituting from (24) we have as the distribution of T : 










2r (784 r P- ‘aT 
‘2 el aaa terttiniiniin 





For 1 this reduces to the form of Student’s distribution given 
by Fisher and tabulated in the issue of Metron cited; however, 
as J may be negative as well as positive in this case, Fisher 
omits the factor 2. 





For p=2 the distribution becomes 


ak - 


From tiis it is easy to calculate as the probability that a given 
value of JY’ will be exceeded by chance, 


(26) 
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a very convenient expression. 

The probability integral for higher values of fo may be cal- 
culated in various ways, the most direct being successive integra- 
tion by parts, giving a series of terms analogous to (26) to 
which, if p is odd, is added an integral which may be evalu- 
ated with the help of the tables of Student’s distribution. If p 
is large, this process is laborious; but other methods are available. 


The probability integral is reduced to the ‘incomplete beta 
function if we put 


awe(h+T Yn)”, 


for then the integral of (25) from T°” to infinity becomes 





P- 1, (7 / 5): 


the notation being 


B,(p,q) -[2?"(I-x) as , 


/ 
Blpg={ 2? U-2)*' dz, 


b, (pq 
I ( , = or ° 
Many methods of calculation have been discussed by H. E. Soper' 
and by V. Romanovsky.? -An extensive table of the incoinslete 
heta function being prepared under the supervision of Professor 


_.Karl Pearson has not yet been published. 


Perhaps the most generally useful method now available is 








1Tracts for Computers, no. 7 (1921). 
2On certain expansions in series of polynomials of incomplete B-functions 


(in English), Recueil Math. de la Soc. de Moscou, vol. 33 (1926), pp. 
207-229. 
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to make the substitution 


=4 /og, (n-p+l)T?- 4 /og. 7p, 


7,= P 


p= n-ptl, 


reducing (25) to a form considered by Fisher. Table V1 in his 
book, Statistical Methods for Research Workers, gives the values 
of Z which will be exceeded by chance in 5 per cent and in 1 per 
cent of cases. If the value of z obtained from the data is greater 
than that in Fisher’s table, the indication is that the deviations 
measured are real. ; 
If the variances and covariances are known a priori, they are 

to be used instead of the Bij 3 the resulting expression ~T' has 
the well known distribution of r , with p degrees of freedom. 
For very large samples the estimates of the covariances from the 
sample are sufficiently accurate to permit the use of the t dis- 
tribution for 7. This is well shown by (25), in which, as 7 
increases, the factor involving JT approaches 


p-1 -TV2 
€ 


> 


which is proportional to the frequency element for Xu when 
is put for JT. 

As Pearson pointed out, the labor of calculating :, which 
we replace by 7°, is prohibitive when forty or fifty characters 
are measured on each individual. With two, three, or four char- 
acters, however, the labor is very moderate, and the results far 
more accurate than any attainable with the Pearson coefficient. 
The great advantage of using JT is the simplicity of its distribu- 
tion, with its complete independence of any correlations among. 
the variates which may exist in the population. 


To means of a single variate it is customary to attach a 
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’ 


“probable error,” with the assumption that the difference between 
the true and calculated values is almost certainly less than a cer- 
tain multiple of the probable error. A more precise way to fol- 
low out this assymption would be to adopt some definite level of 
probability, say 2 =.05, of a greater discrepancy, and:to deter- 
mine from a table of Student’s distribution the corresponding 
value of t, which will depend on 77 ; adding and subtracting 
the product of this value of ¢ by the estimated standard error 
would give upper and lower limits between which the true values 
may with the given degree of confidence be said to lie. With T 
an exactly analogous procedure may be followed, resulting in the 
determination of an ellipse or ellipsoid centered at the point & i 
g,. i we g es Confidence corresponding to the adopted prob- 
ability may then be placed in the proposition that the set of 
true values is represented by a point within this boundary. 


ee 


—— 
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SYSTEMS OF POLYNOMIALS CONNECTED 
WITH THE CHARLIER EXPANSIONS AND 
THE PEARSON DIFFERENTIAL AND 
DIFFERENCE EQUATIONS* 


By 
EMANUEL HENRY HILDEBRANDT 


INTRODUCTION 


The problem of fitting mathematical curves to statistical data 
has commanded the attention of statisticians and mathematicians 
for many years. The curves referred to the most by English- 
speaking biometricians and mathematicians are perhaps those de- 
veloped by Pearson from 1895-1916. He showed that a series 
of curves could be obtained by assigning various values to the 
parameters in a certain first order differential equation. A few 
years later, Charlier*, attacking the same question from a differ- 


*A dissertation submitted in partial fulfillment of the requirements for the 
Degree of Doctor of Philosophy in the University of Michigan—August, 
1931. 
1Karl Pearson, “Mathematical Contributions to the Theory of Evolution,” 
Philosophical Transactions, A, Vol. 186 (1895), pp. 343-414; also “Sup- 
plement to a Memoir on Skew Variation,” Phil. Trans., Vol. 197 (1901), 
pp. 443-456; also “Second Supplement to a Memoir on Skew Variation,” 
Phil. Trans., A, Vol. 216 (1916), pp. 429-457. 
2C. V. L. Charlier, “Ueber das Fehlergesetz,” Arkiv for Matematik, As- 
tronomi och Fysik, Vol. 2, No. 8 (1905), pp. 1-9; also Weber die Dar- 
stellung willkuerlicher Funktionen,” Arkiv for Matematik, Astronomi och 

Fysik, Vol. 2, No. 20 (1905), pp. 1-35. 
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ent angle, showed that any function could probably be approx- 
imated by using a certain function and its derivatives in the terms 
of the series: 


Fix=A,flxd+A, fR+A, i" (x) eee 


where the A; are constants. 

Charlier found that the constants A, could be formally de- 
termined, the nm th constant A, being dependent on the moments 
of F(x) of order not greater than 77. He illustrated the method 
of procedure for the case where y= f (x) was the equation of 
the normal curve of error, i. e. one of the Pearson curves. In 
fact, the successive derivatives of this particular function gave 
rise to a well known system of polynomials, namely the Hermite 
polynomials, and the coefficients are dependent upon these poly- 
notnials also. 

In recent years, Romanovsky' has succeeded in obtaining 
similar results for the case in which some of the other of the 
Pearson curves are used as the f (x) in the Gram-Charlier 
series. The successive derivatives of these other special Pearson 
type curve functions also result in systems of polynomials which 
bear fundamental relations to each other. 

It is the object of this investigation to show: 

(1) That the constants obtained by Charlier for his Type 
A series can be much more readily obtained by making use of 
certain existing biorthogonality conditions ; 

(2) That if the Type A series be generalized to the form: 


4 2 
Fix) =C, Q(x) £2) +C, Fe Ux) ty (x) +, Le Ox) f, (xe 


1V. Romanovsky, “Generalization of some types of the frequency curves 
of Professor Pearson,” Biometrika, Vol. 16 (1924), pp. 106-117; also 
“Sur quelques classes nouvelles de Polynomes orthogonaux,” Comptes 
Rendus de L’Academie des Sciences, Vol. 188 (1929), pp. 1023-1025. 
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where f,, (x) is a polynomial of degree n in x, then the C., | 
can also be formally determined and depend upon the moments 
of F'( x) of order at most 77 ; 

(3) That the form of the polynomials obtained by Charlier 
and Romanovsky for certain solutions of the Pearson differential 
equation can be found for any solution of this equation and that 
the. relations existing between polynomials of the same system 
can also be generalized for the general solution and for the most 
part obtained without having the explicit form of the solution; 

(4) That results analogous to those obtained in (1) and 
(3) can be derived for the Charlier Type B series and the analogue 


of Pearson’s differential equation, finite differences replacing the 
derivative. 


The writer wishes to particularly express his appreciation to 
Prof. H. C. Carver for the valuable aid he has given both in the 
stimulating instruction characterized by’ frankness in indicating 


unsolved problems in his classes and through direct suggestions 
in the preparation of this paper. 
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CHAPTER I 


PoLyYNOMIALS CONNECTED WITH THE GRAM-CHARLIER SERIES 


1. In the articles entitled “Ueber das Fehlergesetz” and 
“Ueber die Darstellung willkiirlicher Funktionen”! Charlier 
proves the following well known theorem: 

CHARLIER’S THEOREM FOR SERIES OF TYPE A—If Fi( x) és 
any real valued function of x, which has finite moments of all 
orders, then F’' (x) may be formally expressed in terms of an- 
other function f (x) and its derivatives as follows: 


(A) F(z) =A, £G)+A, F0)+A, f'Gd+ +A, FOO) ++ 


where f (x) has the following properties: 





(a) 7 (2x) and its derivatives are continuous for all real 
values of x, 


(b) Ff (x) and its derivatives vanish for 2x-=+#+00and -0o 
(c) Jim, x¢% x) 0 for all m andr, 
+00 
(4) ff f(x)ax#0. 
= 00 


The conditions (c) and (d) are not given in Charlier’s ar- 
ticles, hut an eaxmination of the proof shows that he assumes 
implicitly that they are ‘satisfied. (x)= ry satisfies 
(a) and (b) without satisfying (c) and (d). 

In the first section of the latter paper, Charlier determir 2s 
the constants A,,A,,A,, . . .A,, . . . He takes t'.e 
series (A), multiplies it successively by 1, 2, x% . . ., and 
integrates each result between the limits - c to+0®. The fol- 


1C. V. L. Charlier, loc. cit. 
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lowing equations result: 


[Ftwax-A, [Fea 


[vx F(xjdx A,[ x taddx +A, [x Flx)dx 


ns a _— 4 = ” 
I wt Fladx =A, J UX adxra, if 2*¢ AddxtA, [x f (xdx 


Each of these equations contain a finite number of terms and 
the constants A,, A,, A 2’ * + + May readily be determined by 
solving them. In fact we find that any constant A, may be ex- 
pressed as 


+00 
A,-[ Pr@F@az 


where P,,( x) is a polynomial in x of degree not greater than 
m. An analysis of the underlying facts reveals that what Char- 
lier has actually done is to show that under the conditions listed 
in the theorem there exists a uniquely determined set of poly- 
omits PL (a), Ale) .- « Blzh.. « Bx) 
at most of degree 7, biorthogonal to the set of derivatives or 
functions of f (x), i. e. satisfy the biorthogonality conditions: 


[OR Cot adx = 0 for mn 
= 1 for ™m=N 


Further a study of the coefficients of these polynomials shows 
that 


d F, (x) 
ci *-P i, 


i. e. we have the following theorem : 
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THEOREM: If f(x) satisfy the conditions (a), (b), (c), 
and (d) of. Charlier’s theorem for series (A) and if R(x), 
P,(x), ... A(x) .. . ts the system of polynomials 
in x, F(x) of degree at most 7 , which is biorthogonal to 
f (x) and its derivatives, i. e. satisfies the conditions 


fm () F"(x)dx = 0 for mén 


= 1 for m-n 


si ad --P_, (2) 


7 


This can readily be shown to be true directly from a use of 
the biorthogonal property. For integrating by parts we obtain: 


[Ret edz « Biot OU) | - [p! Ga) f'n) dex . 


The first half of the right hand side of this equation 


vanishes due to condition (c) of Charlier’s theorem for series 
(A). For the second half we have 


[Plat (x)dx = 0 for mén 


= 1] for m=n 


But we know that 


+ [Pag FON) dx = 0 for men 
1 for m=77 


determines uniquely the polynomials F,,., (x). It follows that 


aR (x) /dx --B,@) 
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A corollary to this last theorem may be stated as follows: 
CoROLLARY : 


If LPB (x) f™ x) dx = 0 for mén 


for m=n 


AP (x)/dx=- G2 Pry 


The proof is similar to the one just given. Integration by 
parts gives the following result: 


- f FO YX) P’ (x)dx=0 for m#n 


-a, for m-=77 


But we know that 
[10° %) Py dx= 0 for mén 


=an4 for m=-n 


Therefore we may conclude that 


dP. (x) 


ax Ft @) 


7-1 


ARG  _— @, 


P 


x) 
ax 2n-4 7-1 (- 


An illustration of this corollary is the case of the well known 
Hermite polynomials which are involved in Charlier’s first paper.? 
These satisfy the conditions ‘ 

--4) 
ao 


1C. V. L. Charlier, loc. cit. Charlier uses as f (x)° G/F7 © 
this paper we shall use the simpler basic function e-**. 


In 
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#00 -x2 
TH in (2) Hy (22) dx = 0 for m#¢n 


= 2” n!/r for m=n 


H,, Ge -, -”" d7 le ax 4 
Hence 
I Hn (x)d “Ve ~/dz ” dx = 0 for m¥én 
=(-2)"n!lVrr for m=n 


xt 


If then f (x)=e7 
applies, i. e. we have 


AH, (x/dx = 2nH,,, (x) 


and @,=(-2)"n/Vr our corollary 


We might further observe that if a, = (-1)" n/ then the 


polynomials 7 (x) form a system of Appell polynomials? satis- 
fying the relation 


AP (x)/dx-nP,, 


the 77 th polynomial being the coefficient of A”/7/ in the 
expansion of a (4) e”* where 


h* h” 
apy # + * Op et ss 


ath) =, +2 “, + 


The fact that differentiation of the 7 th pol nomial results | 
in the negative of the (77-1)th polynomial,shows that the 7 -h 
polynomial may be obtained by integrating the ( 77-1) th cue, 


1M. P. Appell, “Sur une classe de Polynomes,” Annales Scientifiques de 
L’Ecole Normale Superioure, Vol. IX, series 2 (1880), pp. 119-120. 
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which will consequently determine all of the terms of the n th 
polynomial except the constant. This constant may be found 
from any of the conditions of biorthogonality. The simplest of 
these conditions is: 


Pn f (x) dx =O 


Setting 







P,@)=-["R-1 (x) dx +c 
a 











Sf Re (x) dx +c] f(x) dx =O 
as SOUL P, dx lf (0 dx 
Fim F(x)dx 


and so 






x 
so that P(x)=--/ P,dx+ 






a i P_, &) dx J faax 
f “Fidx 


This gives a very simple and elegant method of writing down 
successively the polynomials associated with any function f (2) 
satisfying the conditions of the theorem. 

Using the Charlier notation 


So x" F(xax 
An ° 7 








and observing that P,(x)=1/A,, we obtain the following 
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polynomials : 


[. a “piwra| F(ajdx 


Pa) =- Pidx+ so 
‘ L i F(~dx 

s.% 

A, Az 


° 


+eof .x 
i. i. B(a)ax\ fix)dx 
J tiadz 


x 
7 P(x) dx + 
° 


x® Ax AZ A, 


a xr 2 


Qo 


> 


I. [ JOR Cxrdz| f (x)ax 


=- v "RB (dx + 
- F(z)dx 


+ 
~oo 


x? Ax? Ax Ax A 2A,A, Ay 
oA ee Rk’ RR. OU: x 


2. Just as the Hermite polynomials, based as they are on 
the derivatives of e - , are the starting point for expansions 
of the Gram-Charlier type and for the theorem just considered, 
so the Laguerre polynomials defined by d”(a+bx)"e “/ax ” 
suggest an expansion of the type 


2 
F(x) =C be) p(x) + C, & fed p(x) + C, Ex fy (x) Pla 


where fe (x) is a polynomial in x. As a matter of fact we 
can state the following theorem: 
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THEOREM: If @ (xX) is a function such that 


(1) Q(x) and all its derivatives are continuous for all 
real values of X , 


(2) Q(x) and its derivatives are zero at x = +00 and - co, 


(3) lim, x” 9"( x) =0, 


(4) {f,, (x))} is a sequence of polynomials in x such 
that [-" f, (x) P(x) #0, 


then there exists a unique sequence of polynomials Fig (at db. 


PB, (x) at most of degree m, such that 


+ oo 


L ROEAG (Wo dx = 0 ae 


= 1 for m=n 
If f, (x) ts at most of degree 7 , then the determination of 
i. (x) depends at most upon the moments of Q@ of order 77 
The method of proof is modelled on Charlier’s proof for the 
preceding case. By substituting in the 7 th integration by parts 
formula 
ful)" @dx = ay. ote (n-d 
tu"y™™.. o(-1)"u My 
6-1) fa Or (x) v(xJdx , 
we have 


1The Laguerre polynomials are not a special case of this because there the 
interval of integration is -a/b toto. 
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a aP. a? 
[- Rw — io at fF &) P(x)~ al rl (2) 


ae 


t-te)” ae alan 





norte P, (2) FO (x) dx 


+0| 7” 
=(-"/ Ss ee 2) £0) Plu) dx 





because of conditions (2) and (3) on @ (x). Asa consequence, 
if 7 > m then 427 P, (x)= 0, so that for 7 >-777 











+oo a” 
J we ®D) atx” bn (2) Plx)Ax =O 


77 
that is to say A, (x) is orthogonal to fon f,, (x)9(~) 
provided 7 > m Hence F (2x) must satisfy only the follow- 
ing 77+1 equations: 


77 


i Fon 2) f, @) P(x) ax-O 


+00 d +00 d P (%) 
a — f@PCadx = eof FS tT, QP @H)dx = 








+00 da? - amin 
[ROR FC) PD (x)dz =(-1) i ave EOP (x) dx-O 









a 
[30S nt, topldar= -D”| — p 


ao 






Replacing now 7, (x) by @,4+4,%+4,x*+.--+a,x% 
gives us the system of algebraic equations to be satisfied by 
Qo,@,.+* 52 +2 + a , Viz? 
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a, [ t”Pwdx+a, [ Xf () Plddx 
ta, [ 2*h ix) pidder-----4a, ["X" h(x) P(x)de= 0 
Q, [ ‘4 (2) Dc)dz+ 2a, J: xf (x) Ple)dzr- “+712, [ 2 () Dx)ddc-O 
‘2a afte (x) P(x)du+---+7(nA1)a ze xf Lid) P(x) dx = O 


; ‘i i 
(7-2)/a, we (x) wd oterder lly 1) Zh 0-2 OP(9dx 


vee. x2 Fg) P(%) dx-0 


nif +00 
(7-1)!a,, 1 CD O(x) dx +73 GJ * Ff, (I P(e) dx<O 
ee 27-2 


7 +60 
he nla, Lt, (JD (x)dx =! 
We have here a unique determination of a,, if the determinant 
of the coefficients is ¢ 0. This is true since the determinant 
A=(-1)"(/RoIVJt,g).... 7 £9) is#0 because of the 
condition (4) on @. If Ff, (x) is at most of degree 7, it is 
obvious that the determination of the FP, (x) resulting from 
the coefficients @, depends at most upon the moments of @ 
of order 7. 
The first three polynomials of the type considered in the 
last theorem have the following form, the limits of integration 
being - co and + 2 in each case. 


P (x)= Sz P(x) dx x 
St WE) Draddx TPixrdx St adO0ddx 


tS cera -| 
S£@ew@dx | Jonmdx ; 


Sut, COD (x)dx [x Dax [x? Dladz 


BO Te Coe laax ft () place] Oixjdx 21h, Neda pile 
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_ _xSx fy (4) 9 (x)aAx 7 a 
St A) Diurax [fk (xc) P(x) ax 2']t A)Dlxi)dx 


Sx hindl(idzJet, WPx)dxfeplxdx 


FE GHD cqnendcainnmanemmasnecianasameniaminesumanenmmemmans 
9S Cargptands [tz Idx ff (yb ade [Vlad 


_ Sf 2?h 20 dx feGa)dz 


2l[t, JU xdx]t J gploax]/plrax 


Sep Cblalde fa? bixiade 
2!/f, (Pld ax Jf, CI Pfdx]0(x)dx 


x3D (x)ax z [eh iJ) O(adx [et (af le) ax 
"3h h,@P(wdax [Predx St f, (2) UfaJdz Tt; G wW)MJdz [f, EL (DPlx)ae 


: 2 [x*4, (df (t)dz ‘ 2* (rh (wD (adx 


2! [tyagddx fi agiajde 2, Wpidar tw piddr 


z3 
Sf fy) D(2)dx 
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CHAPTER II 


POLYNOMIALS CONNECTED WITH PEARSON’S DIFFERENTIAL 
EQUATION 


1. In the work in mathematical statistics a large number 
of the problems that require study involve data properly classified 
into groups and about which further information is sought. This 
data is often classified to form a frequency distribution. The 
frequency distribution when grouped may appear to lie on a cer- 
tain curve. If it can be shown that this curve is a mathematical 
curve, i. e. one for which we are able to set up an equation, then 
this frequency distribution can be readily examined and studied. 

There are very few frequency distributions which actually 
conform to known mathematical equations. However, there are 
certain curves which seem to lend themselves much better to 
statistical manipulations than others. Among the most commonly 
used of these are the so-called Pearson type curves. Pearson! 
showed in a series of three articles how. he obtained the equations 
of twelve distinct curves and this was done by considering the 
differential equation 


1d. @,+a,x 
y Ax Dt 6x4 Bx? 


and solving it, after assigning particular values to the parameters 
2,,4,4, 5, ,and 5, . The equations of these curves and 
the differential equations from which they were derived are as 
follows: 


1Karl Pearson, loc. cit. 
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The curves most widely used are the normal curve of error, which 
Pearson calls Type VII, and the Type III curve. 

Suppose a Pearson curve f (x) has been found which 
seems to fit a given distribution fairly well. The question may 
well be asked: Is it possible by means of analytic methods to 
approach even nearer to the given distribution? For example, 


would it be possible to use this approximate function 
as the # (x ) in the Charlier series (A) and thus obtain a closer 
approximation to the observed frequency function. 

Charlier in his paper “Ueber die Darstellung willktirliches 
Functionen’? considered this question for 9 (x ene ; 
i. e. the normal curve of error. He showed that using this 
g (x) reduced the series (A) to the form: 


(A’) F(x) =2,9(”) +a, gx) +agP lx) + 4a, 9 %Ur)e--- 


the first and second derivative terms vanishing due to the proper 
choice of constants. This series (A’) is frequently referred to 
as the Gram-Charlier Type A series. It is worthwhile to note 
that this Y (zx) is the same one whose derivatives we found in 
the first chapter resulted in the Hermite polynomials. These poly- 
nomials have the following interesting properties?: 


(1) dH, (x) fax =< 20H, , (x) 
(2) Aygyy (2-2 Hy (2)+ 20H y.4 (2)=O 


(3) Ae (a) -2xH, (2) + 20H, (x) =O 


The first of these relations shows that the derivative of any Her- 


mite polynomial corresponds to the preceding polynomial multi- 


1C. V. L. Charlier, loc. cit. 
2R. Courant and D. Hilbert, Methoden der Mathematischen Physik, 1, 


pp. 76. 
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plied by 27. The second equation is a recurrence relation between 
the (77+ 1) th, 7th and ( 77-1) th polynomials, while the third 
relation is a differential equation of the second order involving 
only the v7 th polynomial. 

The use of the equations of the other Pearson type curves 
as the f (z) in the original Charlier series has in recent years 
been studied by Romanovsky. In the first' of two articles, he 
discusses the Pearson Type I, II and III curves as well as the 
Type VII—the normal curve referred to in the last paragraph. 
Just as the normal curve of error requires the use of the Hermite 
polynomials, he found that the Type I curve and Type II, which 
is a special case of Type I, involved the Jacobi polynomials 


zi %y- 2) 2" a” gt7-1 ort), 


G,(h,¢,2)= g(r) ---(gt7-1 “az” \|* & 


The 7 ’th Jacobi polynomial satisfies the second order differential 


equation”. 


(1-2) G(x) + [¢-(p+l)x] G, (x) (ptm) nG, (x)=O 


which corresponds to property (3) mentioned for the Hermite 
polynomials above. The Type III curve involves the Laguerre 
polynomials’ defined by 


” 
Lnlxe e *Saw(z"e-*) 


and these in turn satisfy the recurrence relation 


1V. Romanovsky: “Generalization of some types of the frequency curves 
of Professor Pearson.” op. at pp. 106-117. 

2R. Courant and D. Hilbert, op. cit., Vol. I, p. 75. 

3R. Courant and D. Hilbert, op. cit., pp. 77-78. 
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L net (2) -(2 4+1-x)L, (2)+ ha (x)<O 
and the differential equation 
Li @)-aly.,@= -nl,, 


In the second article?, Romanovsky reviews the cases of 


the Type 1V, V and VI curves. The generalization of the Type 
IV curve gives the polynomial 


7 , i 
P (7m, x)=(a% x*) e Ve | ota waer “ 


where Q=arc tan %/qg. These polynomials possess properties 
similar to the other polynomials mentioned, viz. : 


P41 (n+l, x)= [20 741-m)z- Va i (2,2) 


+27 [r+1- m2] @*4z9F_, (7x) 
and 


(a*+z*) P (nz) +[ 2(1-m) zx - Va] 


P'(n,x)-n(nel-2m)P, (m, x) =O 


Similarly for the Type V curve he finds the polynomials 
v 
h & - “At2y7  ~&% ) 
Pi thz)=x e° Tonle € 


Also the relations 


2V. Romanovsky, “Sur quelques Classes nouvells de Polynomes othogonaux,” 
loc. cit. 
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P.,, (nel)x) [ Cne2-p)x+Y|Pinz)s n(én+2-p)z*P,(n,2) 


and 


x* PX(nx)+ [2(2-p)+V] By (m,x)-n(nel-p) B, (7, x) +0 
hold. 
Finally for the Type VI curve Romanovsky gets the polynomials: 


77 
P,, (-h.q x)= (x-@) 7" 2» [cea ~? “7 


and the relations: 


Fs (m+1,x)= [(Kp+lNz-a)+(qet) x| Pin x)+x(2z-a)P, (n, x), 


2(x-a)P'(n, x) (- pel Nx-a)+ (qe )x| P,, (n,x)-nlneleg-pJP.(n,)20 


We note, therefore, that if a solution of the Pearson dif- 
ferential equation is used as the generating function f (x) in 
the Gram-Charlier series, that a distinct set of polynomials re- 
sults in each case and that these polynomials satisfy certain re- 
currence relations and differential equations. These properties 
are not found in the case of functions such as sech x and 
sech x , which were discussed as generating functions by Char- 
lier? and by Roa? respectively. The successive derivatives of the 
iC. V. L. Charlier, “Ueber die Darstellung willkirlicher Funktionen,” loc. 
cit., pp. 18-22. 
2Emeterio Roa, “A Number of new generating Functions with Applica- 
tions to Statistics,” Doctor’s Thesis, University of Michigan, 1923. 
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sech x do not result in polynomials such as the Hermite or Jacobi 
ones. 

Since the generalization of the solutions of the Pearson 
curves leads to distinct sets of polynomials and since these poly- 
nomials satisfy certain fundamental relations, we are led to inquire 
whether these polynomials are not special cases of a general poly- 
nomial and may be obtained from it by specializing the coefficients 
and further whether such general polynomials, if they do exist, 
will satisfy certain recurrence relations and differential equations. 
These problems are among those which we shall consider in this 
chapter. 

2. In order that we may develop the generalized polynomials, 
let us consider the Pearson differential equation where the numer- 
ator is of the first and the denominator of the second degree, i. e. 


Lady. araz 
yY a& b,+b,x+b,x 
For convenience we shall denote the numerator by /V and the 
denominator by D. We then have the following theorem: 
THEOREM: If y is a non-identically zero solution of 


dy_ N 
(1) “i. DY» 


F D” a” A . 
then > aren is a polynomial of degree at most 77. 


The proof will proceed by mathematical induction. It is 
obvious that the theorem holds for 7=1, P (x) being NV. 
Since it is true that 


d 
O-ae" Ny 


we obtain by differentiation 


d*y 1 dy dy ‘ 
On * 2 s& NS ONY, 
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or using (1) and multiplying the equation through by D we get 
2 aZ*, 2 / ‘ 
D? D4 = (N*- ND'+ N'D)y 


Since D’ is linear and NV’ is a constant, it is obvious that 
(N*-ND + N'D) is at most of degree 2. 

Assume then that the statement holds for 777 < 77 and we 
have 


2 0" 2, = Pay. 
Differentiation gives 

nD” D'S n + D” EE ms Pl Get Lf y. 
Multiplying through by D we get 

D™ dows “DP, 2) 2L-np"D' £3, +D sig y, 


and using (1) and (2), we have 


| ‘ dP (x) 
Py - NF tay - nD F, Wy +D—2—y 
= Wig) - DR 0) - 0 y. 


The coefficient of y is obviously a polynomial of degree at most 
77+1. Incidentally we have derived the relation: 


(1) Pret(@)=Pr (2)(N- nD')+D ws (x) 


an equation which gives the ( 77+1) th polynomial in terms of 

the 77 th polynomial and its first derivative -. (2%). 

3. More generally we have: 

THEOREM: If y is a non-identically zero solution of (1), 
/ n-k a” 
y - dz 


then k 
7A Dy 
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is a polynomial B,( kK,x), re (K,2) ts at most of degree 
7 in X. In particular if k=7, we have that 


{77 
= ax D”y 


is a polynomial in x of degree at most 77. 

This theorem can be proved directly following the lines of 
the preceding theorem, but it is simpler to obtain it as an imme- 
diate consequence of this theorem and the following lemma: 

LemMaA: If y satisfy the differential equation (1) then 
D*“y , where k is any real number, satisfies a differential equa- 
tion of the same type, viz.: 


d N+kD' 
az (D *y) - “—— b*y 
Let we D‘y 
Then logarithmic differentiation gives at once 


1 du yp D', 1 ay | N+kD 
uaz D y dad D 

It follows from this lemma that any result which we derive 

concerning the polynomials P (x )at D” 2, where y sat- 
7 7” azx 

isies D dy/dx=Ny _ ,is immediately extensible to the poly- 
nomials F(k,x)= $ p7* aor Dy by replacing /Y by 
N+kD’. In particular relation (I) becomes 


7 P. (kel 
(Ig) Boag (k#1,x)=[N4 (k= nol) D'] DB, (ket, x) 2D 20 z) 


which for ‘= 77 reduces to 


(L,) Paeg (rl, 2)=(N*D) P,(nel,x)+D Palreh ed | 
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We single out the case K=77 because of the fact that this 
case parallels most closely the Charlier or Hermite polynomial 


case. For in this latter case the 7’th derivative of the generat- 


«a. ‘ : 
ing function e ~ is the product of the generating function and 


a polynomial of degree 77. So in the case of any solution y of 
a Pearson differential equation, the 77 th derivative of D”y 


is the product of the generating function y and a polynomial of 
degree at most 77. 


By means of relation (I), we can write down the successive 
polynomials A(x), A(x), . . . The first five polynomials 
may be written as follows: 

P, (z)=N, 
BD (z)=(N-D')P, (2)+D LE). N*ND'tN'D 
2 (2) =(N-D!) P, (2)+D LEE ~ w*% ; 


P(x) =(N-2D)) B(x)+D: LO 
=N*3N2D'+ 3NN'D+2ND'*-2N'D'D-NDD", 
Py (2) =(N-3D') P, (x) +D _ 
= N*6N°D+6N7N'D +11N*D" -/4.NN'DD'-4N7DD" 
= -6ND'+6NDD' +ONDD'D'*3N' D-3N'D *D” 


G 
Ps (x) = (N-4D) PB (2) +D “5 az x 


‘ < ’ 
= N*-10N*D's10N ND+35N D -50ON'N DD 
, Fa 2 
IONDD'-50N*D'+70NN DD -40N*DD'D" 
1% 2 ta 8 “ i* ’ 4 3 
+15NN D -25NN'D*D 424ND' -24N'DD 


” 2 
-36NDD' D" -20N'D'D +24N'D'D'D'*6ND*D" 
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4. Following the analogy with Hermite polynomials, we ob- 
tain next a recurrence relation involving the ( 77+1) th, 77’th 
and ( 77-/ ) th polynomials. 

Starting with the original differential equation 


dy 


Daz 





=Ny 


we take the 77 th derivative of both sides, which by Leibnitz’ s 


theorem on the derivative of a product gives us, since a7 =(0, 
da mel, ‘a* -. m(n- A) 5 va” % a’ %, Qt 
ss? +2 +7N eat, 
- dx 7 2! dx™ Nay ax” 


Multiplying this last expression by D ” and collecting terms, we 
get: 


741 ‘ rla-Ys, ‘ aly 
pty 4nd!) ty aD" 28 Mp" nw | OY, 


da” 
Replacing now D” rid by %, (x) y and dividing through 
by y , we get the recurrence relation 





(II) Pays C(nDN)P, (x) n| 22 D*w'|D P P.,@-0 


We note that the coefficients of P,,,, (2) and F(x) are 
the same as in relation (I) which we found to be 


aP (x 
Pryyy 0) + B, (x) (mD'-N) =D —3— - 


Hence 


d P. (x) , (7-1) 
am BR on |W YD) 2 cw 
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or replacing 77 by 77+1 we write: 


a Pa) 


—-* (m#1)N'- 2D P,, (2) =(m+lla,-7 bz) PC). 


This equation is the generalized form of the one for Hermite 
polynomials, viz. : 


On = 2n Hy, , (2) 
5. Relations (1) and (III) may now be used to obtain a 
second order differential equation. Differentiating (1), we get: 
Prins OO) 4(nD"N' P, (Ce) Hn D*N) PB! (x) 
-D'P} (x)-DP,"(x)=0. 


Substitution of the value d@F,,, (x)/dx from (III) gives 
us: 


DPitxd+[N- (1-1) DP (x) 
(IV) 


vine w'- mo? Pp, (x) =O 


We readily see that the relation found for the Hermite poly- 
nomials 


H% (x) -2xH, (x)+ 2nH, (x) =O 


is a special case of (IV). 

Using the lemma previously proved and replacing N by 
N+kD' we can write (IV) for the polynomials r. (k,x) 
and PB ( 7, x ): 


DP,"(k, x)+ [N-(n-k-1)D"| PB! (k, x) 
CV) (n-2k-1) 
-n |v (n-Bk op" P(4,x)-0, 
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DR"(nx)+ (N+D') P(x) 


(IVp)* 


- n| W's (rel) "|B (n,x)=0 


We recognize the second order differential equations mentioned 
earlier in this chapter for the polynomials of the Pearson Type 





1Since D is any expression of the second degree and NW is any expression 
of the first degree, it is obvious that <“" (x) satisfies a linear equation 
of the second order of the form: 


/ 
(A, +A, +A,x*)y +(B,+B, x)y'#Cy=0 


where C =-7 [in- A, + B, | . It may be shown that if a differ- 
ential equation of the form considered has as one solution a polynomial of 
degree 7 then C must be of the form specified. For suppose @,,( x ) 


satisfies the above differential equation for y. Taking the 7’th derivative 
of this equation we get 


mat) ‘2 A,(n!a,)+ 1B (n!/a,) +C(n!/a,)<O 





and solving for C that: 


C=-7n [(7-A, + B, | : 
It follows from our work that if a differential equation has the form 
(A, tA,x+ A,x*)y"+ (B, +B, x)y' 
-* [(n-)A, +B, | y=0O 


then one solution of this differential equation is a polynomial of degree at 


most 77 obtained by finding the solution y of the Pearson differential 
equation 


dy_B +B, x -(A,+2A,~) 
ax A,t A, x+A,x 
and determining the polynomial 


PB, (n,xd=$ a, {fA. +A,x +A,x*|” y} ; 
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IV, V and VI as well as the Jacobi and Laguerre polynomials as 
special cases of formula (IVy). Some further illustrations of 
(IV,,) are the Tschebycheff* and Legendre? polynomials. The 
Tschebycheff polynomials are developed from the differential 
equation 


dy x 
adzx* Txt Y 


and in this case formula (IV,,) becomes: 
(1-x9)B"(nx)-xPB, (nx)+n®P, (nx)<O 
The Legendre polynomials 
P(x%)= , a“ 


ee axn7 


have as a corresponding differential equation 


dy O-y 
dz" x*-1 


and in turn formula (IV}) is written: 


(x2 1)P" (nx)+2xP, (nx) - n(nsl) PB (2,x)=O 


6. Just as in formula (II) we established a recurrence re- 
lation for the polynomials = (x), let us now obtain one for 
the polynomials ‘ ( 7,x). 

Consider once more the first derivative of D “ yi ¢ 


2 p'y)a(Ken DD sD’ 


=[N+(K+1)D'|Dy 


1R. Courant and D. Hilbert, op. cit., pp. 73-74. 
2Ibid, pp. 66-69. 
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Taking the 77 th derivative of both sides of the equation we get: 


oy (ao k+ly)« [etre DL, ig 
tn|[N (cond at Dy . 


Multiplying or sides of the equation by D”* and replacing 
p** 42.5 ky by P, (4x )y, we have 
ax” 
Past (K +l, x)=[N+(K+1)D'|P (kK, x) 
(Ve) , 
+n| N'+&+1)D"|D-P_, (K, x). 


In case we set A= 77, we may write 


P41 (ntl,x)= [V+ (n+1)DP, (7x) 
t n[N'+(7+1)D'\D-P_, (nx), 


(Vn) 


a recurrence relation similar to (II) and involving the poly- 
nomials P,,, (7+ 1,2), Pi(mx)and P,)(7,x). 

7. Formula (V7) may be written in still another form cor- 
responding to formula (1), i. e. a relation consisting of the same 
terms as (V;,) except that the ( 77-1) th polynomial 7,_,( 7, x) 
is replaced by the first derivative of the 77 th polynomial 
(m2). 

In order to obtain this relation we return to formula (III), 


dP nh = n| W’- Dp Pos (x) 


and substitute for NV the value NW + kD’ and obtain 


a F,(7,x) ‘ £) 
—_ a. n|'* Cree. p’| Py (nx) 
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Sia! 


P11,” = WW are 


Substituting the value for P__, ( 7,x) we thus obtain: 


P , (a+1,x)- [N+(m+D'P (77,2). 


N'‘+(741)D" D. Fi (7,x) 
N'+( "a" a” ax 


From symmetry we might expect the fractional coefficient of the 
derivative P( 77,z ) to be unity, but unfortunately this is not 
the case. 

8. In looking over the relations existing for the Laguerre 
polynomials we find one consisting of the first derivatives of the 
77 th and (77- 1) th polynomials, and the ( 77-1) th poly- 
nomial,? i. e. 


gil (77,x) - 7P., (1x)=-0 Fh, (wt, x) 


This relation is a special case of another form of formula (VI) 
which we obtain in the following manner: 
Differentiation of (VI) gives us: 


dP. 


aP/ 
Ane CHh*) «TN '¢(nt)D™|P, (nx) # W4(nt1)D) == 


N'+(741)D" ‘dB inx), N+(nDd a*P (nx) 
ye taglps 2 Se? Wetaglp" PB 


Substituting the value for @2*F,( 7 x ) /dzx* found in (V) 
changes this last expression to the form: 


1R. Courant and D. Hilbert, op. cit., pp. 77-79. 
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Best t x) CAD) Iy'o(n01)0"| B (nde |Nnt1)D | “2 
x 


N+(7#+1)D" pi Zh alm, 7%) Wl 
t N'+ 27D N"+ (231)D 


‘ ak ( A ) 1 . 
E wep) 22.0? 4m (2H We nx 
which reduces to 


P. (amet i 
ae . =(7+1) [w’ +(n+1)D"| P 7, (71,%) 


(VII) . 

+ fvetnes) D1 -tagh 5 N} = . 
The special equation mentioned for the Laguerre polynomials 
will be recognized as a special case of formula (VII) if we recall 
that for the Laguerre polynomials the differential equation is of 
the form 


ay | P-x 
dz x y 


Substitution of x for D and ( p-x ) for N reduces (VII) 
to 


PL, lx) =- (ntl) PR (nxJ)+ (nel) P, (nx). 


9. In this chapter we ae defined two general types of 


‘ D” d"y 
polynomials P,, (z)=— : an 


Dp7-* 
and PB, (k,x%)= 25 ton D*y 


The relationships for these polynomials (x) and F,(k,x) 
were derived without using the form of the solution of the dif- 
ferential equation. Two fundamental formulas were derived, 


for P(x): 


(I) Py (x)* (N= nD')P, (x) + D 2 
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and for P ( 7,x) the corresponding formula: 


/ N+(n41)D™ P 
(VI) Pj (m+1,x)= [We(nes)D’| inh tae D2 
“— ” 


Two successive polynomials were shown to be related by the re- 
lations, for P(x): 


AF (x) | ’ (77-1) ‘| 
ian ao ceeds Pp 


and for P( 77, x): 

(III,,) ffnm*) = nN’ is (ret) p"| p P,,(nz) 

In addition we found that it was possible to set up recurrence 
relations involving the ( 77#1) th, 77th and ( 7-1) th poly- 
nomials and found these to be, for /,( 2): 


(Il) Pag (x) #0 nDNDP, (en| 27D") D- B, yl)-O 


and for P_( 7, x): 


(V,,) BL, (+t x)= [W + (WDB (mx)4n|N' +(n+I)D"\ DP nx) 


We further succeeded in developing a second order differential 
equation for the 77’th polynomial 4" [a2 


(IV) DP, +[W-(n-)D'] B09) - n[w'- S50] P, (x)= 0 
and for r4 mx): 


(IV) DP,(n,x)4(NtD IP, (7,2)-rIN'+ CH 0" P (n,x) «0 
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We also showed that we could derive a relation between the 
derivatives of the polynomials P,, ( 7+1,%), By (72x ) 
and the polynomial “A (71m,x): 


(vit) 2 Frcs (6h*) <n s1)(N'+(ns)D]R (mx) + 


| _ N‘+(741)D” dP l , x) 
{lws(r41)D" We (Bel) pT Ha 


Finally, we noted that all of these formulas and relations apply 
to the Hermite, Jacobi, Tschebycheff and Legendre polynomials 
as well as the polynomials derived for the Pearson Type IV, V 
and VI curves by Romanovsky. 
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CHAPTER III 


1. So far the discussion in this paper has been limited to 
the treatment of the Gram-Charlier series where the constants 


Ay AyAz» » + »A,, -» » . depend upon polynomials in 
x which are independent of the function F(x), and the gen- 
erating function f (x) is a solution of the Pearson differential 
equation, the functions /’( x) and f (x) being defined as con- 
tinuous functions. The work in mathematical statistics involves 
not only the use of the continuous variate and the continuous 
function but also the case of the discrete variate and the discon- 
tinuous function where this function is defined for equally spaced 
values. 

In dealing with the continuous variate we make use of the 
theory of the differential and integral calculus, or the calculus of 
limits, as it is sometimes called. On the other hand, for the dis- 
crete variate we turn to the theory of the calculus of finite differ- 
ences. Further, it usually happens that there exists a parallelism 
between results based on the derivative and integral and those 
based on the finite differences and summations. As a consequence, 
it seems natural to attempt to derive results for the finite differ- 
ence case paralleling those contained in the first half of this paper. 
The second part of this paper is devoted to this purpose. The 
first of the two following chapters considers matters pertaining 
to Charlier’s Type B series which is the finite difference parallel 
to the Type A series, while the next chapter is devoted to the 
polynomials connected with the finite difference parallel of the 
Pearson differential equation. 

Charlier in the second half of his article’ “Ueber die Dar- 
*C. V. L. Charlier, op. cit., pp. 23-35. 
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stellung willkurlicher Funktionen” considers a real valued func- 
tion F’( 2) and asserts that it may be formally expanded in 
terms of another function and its successive differences. Stated 
as a theorem, this may be written as follows: 

CHARLIER’S THEOREM FOR SERIES B: Any real valued func- 
tion F’( x) which vanishes for x = 00 and - o°, may be form- 
ally expanded in terms of another function g(x) and its suc- 
cessive differences in the form 


(B) F(x) -Bgl)+B Ag) + B,A"g(x)+ + B, A” g(x)+ 


where G(x) possesses the properties: 


(a) G(x) and its differences are defined for all real values 
ofx , 


(b) g(x )and its differences vanish for x = +o and - co 


3 


(c) x77A ” 9 (x) "0 for all real values of mandn. 
(d) Av g(x) |t3# 0. 


Paralleling the theory of the first half of his paper, Charlier 
determines the constants B B, ‘ B, Shs gece 


* . and 


finds that they may be expressed by the equation 


B,, = 2, Q(x) F(x) «AQ, (x) Fx) \*% 


where Q 7( x) is a polynomial in x of degree not greater than 
7. Analyzing the answers that he obtains for a ,( x), we find 
that these polynomials form a uniquely determined set of poly- 
nomials @,(x), Q, (2x), -»Qlz), - + 4 Q, ee 6k 
a,( x) at most of degree 77, biorthogonal in the sum sense to 
the successive differences of the function g ( x), i. e. they sat- 
isfy the biorthogonality conditions for the inverse of differences: 
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-1 m +o =O for nym 
A Q,0CI4B GO|. 1 for n=m. 


Charlier does sot observe that the polynomials @_( x) bear a 
definite relation to one another, i. e. 


AQ,,00 = -Q,_, (+1), 


a relation similar to the one found for the polynomials Pp ( x) 
in Chapter I. We may state these facts in the following theorem: 

THEOREM: If g(x) satisfy the conditions (a), (b), (c), 
and (d) of Charlier’s Theorem for series B and if Q,(x), 
Qi(%), . . 4 Q(x), . . . ts the system of polynomials 
in x, Q,( x) of degree at most 77, which ts biorthogonal to 
* (x )and its differences, i. e. satisfies the conditions 


oo = 0 for »#¢m 


- 77 
A'@, (4 ov i, « tte em 


AD QW 00 *- Opi, COL). 
The proof requires the use of ‘the finite integration by parts 
formula: 


-4 -1 al 
a Uy vy = Uy 4 v, -4 


~J 
[ 4u,-4 Veet] 


Applying this formula we get 
A’Q,, (x) Gx) ne =Q,, (x): A" “G(x) |** 
-4"[a Q,,(2) a” G (xt1) We 


The first term on the right hand side vanishes due to condition 
(c) of the theorem of Charlier. Comparing the term which 





E. H. HILDEBRANDT 


remains, i. e. 
~f m-1 +” =0 
- [AQ,~oA gel) x“ 


with the biorthogonality condition 


-f m-L 400= () 
A 2,441) grt) 


we conclude that 


AQ, 0) =-Q,_1(*+L) 
This theorem enables us to find the terms of the 77 th poly- 
nomial by taking the negative of the integral of the ( 77-1) th 
polynomial, except for the constant of integration. Following the 


suggestion in our first chapter, we may also determine this con- 
stant. We have 


Onlx)=- AQ, (x11) |Z + 
and the simple biorthogonality condition 
A*Q, (x) g(x) a =O. 
It follows that 


4“-A"Q,., (xo14C HX) a 2 


and solving for C we get 


" AA QOp.4 Et) ]o 90) |"S 


G 
A9(x)|*S 


We may therefore determine the polynomials ?, (x) from the 
polynomials next preceding by the formula 


a [4°10,_, er Hx) |x. 


Q,,()=-A7Q,., a+) | + 
A-* g(x) |\t% 
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If we adopt the Charlier notation 


Cun =z) x"G(x) = At xg (x) we 


and the common notation x= x(x-1)(x-2)----- -(#-m+1) 
and observe that @, (x)= 1 Eo and that 
‘i (7m) _ x (m+1) 
Aux 7 Mm+lL 


we may obtain the polynomials Q,(x), Y,(), 
without much computation as follows: 


a [a-4 Q, (x+1)|" Hx) \*° 





x 
Q,0) = -44Q, (x+1) |, + : 2 
j A o(x)|*2 
gee ic 
= z + €2 
=f ai ¥ +0 
Q, (x) = -A4'Q, (c+) . pe 4 [a Q, (x+1)]* jx) oud 
° A-t9(x)|*® 
_ (zt _ 4 etl) , 2EF +66 -E,&5 
«BE, - -_ 


orl é,Q, (x)= 6x" -E, x(2E, -E,) +26,-E, 6-66 


* af a + 00 
@00--s7@ (ant + LS). 0 [., 


-1 
AM Gx)| > 
- cera)” | é, (x+2)™ af (2E7+E if, ~ ke &, Mx+2) 
LE, yr4 .— 2 ng 


a €5Er~ 3E2,€o -6 65 & 16,7 + 6G7E, +2E, rg 
L3€3* 


Or 136Q,(e) = ~ 62x? +3 62x77 (GE) 
- 6, *(2€3- 66,6 36,6, +662 4 6, + 3E 65 +26, 68 
~66,6,6,- 66°64, +66," 





| 
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These results differ slightly from those obtained by Charlier 


in his article. This is due to the definition for differences used 
by Charlier, viz. : 


A g(x) = g(x) - 9(x-1) 


while we have used the definition 
A Gx) = Gfxt1)-G(X) . 


Denoting the difference 


GX) -G(x-1) by 69(x) 


Charlier determines a set of polynomials 7, (x) satisfying the 
conditions , 


= 0 for myn 


67 (7, 00) 6 g(x) 


= ] for m= 7 


As a consequence by paralleling the reasoning above one proves 
easily that the ln (x ) satisfy the recurrence relation 


7,, (#+1)-T,, (ade -T7,,_, (&) . 
By using this relation and the fact that 
6 "9 (x+n) = a”"glx) 
it can be shown without much difficulty that 
T,, (x+ n-1) = Q,, (%) 


The theorem proved in Ch. 1, par. 2, could no doubt be 
paralleled by using finite difference theory. Since the method of 


procedure is obvious there seems to be no need of taking it up 
in detail. 
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We have succeeded in showing in this chapter that the prob- 
lem of determining the constants for the Charlier Type B series 
closely parallels the work of the first chapter and that these con- 
stants are readily obtained by using the biorthogonality conditions 
for finite differences. 
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CHAPTER IV 


PoLYNOMIALS CONNECTED WITH THE PEARSON DIFFERENCE 
EQUATION 


1. In Chapter II we referred to certain solutions f (x) 
of the Pearson. diiferential equation and noted that graphically, 
these functions represented types of curves used in statistical 
work. Paralleling this work, we would expect to find that a dif- 
ference equation similar in composition to the Pearson differential 
equation would have as solutions functions g ( x) which could 
be used to represent data consisting of discrete variates. 

Carver, in an article in the “Handbook of Mathematical 
Statistics,”* suggests the use of a difference equation correspond- 
ing to the Pearson differential equation, i. e.: 


Ga + GA, % 
Ay Bab x + Oyxte bute OF? 

a difference equation with a numerator of the first and denom- 
inator of any desired degree in x. If we confine our work to a 
denominator of degree at most of the second in x, we should be 
able to obtain results comparing very favorably with those ob- 
tained in the second chapter. 

An illustration of a solution of this differénce equation found 
in Charlier’s article “Ueber die Darstellung willkiirlicher Funk- 
tionen,”? is the well known Poisson exponential function 


Ys (x)= Man 


x 


1H. C. Carver, “Frequency Curves,” Handbook of Mathematical Statistics 
(H. L. Rietz, Editor), Chapter VII, pp. 111-114. 
2C. V. L. Charlier, op. cit. p. 33. 
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This function satisfies the difference equation 


hu, Se 0 


x+] 


and this equation is recognized as a special form of the Pearson 
difference equation. If we take the successive differences of this 
Poisson exponential function, we find that these give rise to a 
unique set of polynomials. These polynomials may be written in 
the following form: 


Q, (x)= A-(x+1), 
Qo (x) = A*- 2X (x+2)+(x42)(x41), 


or making use of the usua! difference notation for 
x, x (2%-1)(x-2)-----(2x-r7+1) , we write 


Q, (2) = A?- ZA (x+2)+ (x12) 


(2) 


Q, (x) = A°- BX* (x43) + A(x 3)? (x43) 


Q, (x-3) ao APs SA*x + 3Ax Oe ™ 


? 


Q,, (x)= A"- CA eon) Cyr” “Ceonyh...4(-1) "ery 


or Q, (x-7)=A™,, C,Aa” oe Gas 7... «62s - 


These polynomials have the same form as that for the bi- 
nomial expansion (A-x )” , particularly if we use the differ- 
ence notation for representing powers of x. In other words, we 
might look upon the 77 th polynomial as being defined as 
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Qn (x-n)= [A- x 7” 


A careful examination of these polynomials brings out the 
fact that consecutive ones are related to each other, viz., that we 
have, 


A Q,, (4) = -7Q,,-4 (#1) . 


This relation is similar to the one found for Hermite polynomials. 

The fact that the Charlier Type A. series in Chapter II 
consisted of successive derivatives and that the derivatives of the 
solutions of the Pearson differential equation led to a system of 
polynomials definitely related to one another, gave rise to the the- 
ory developed in that chapter. We found that it was not neces- 
sary in this theory to consider the form of the solution of the 
equation, but that a set of general polynomials could be set up 
which satisfied all the properties of the special polynomials. The 
Charlier Type B series consists of successive differences of a func- 


tion g (x) and it is quite natural for us to suspect that we can 
develop for the solutions of the Pearson difference equation a cor- 


responding theory on polynomials. 
This question of obtaining a system of polynomials from the 
solutions of the Pearson difference equation 


Q,+ a,x 


(1) Au 


x b+b,x+ b,x? _ 

numerator of the first degree and denominator of the second 
degree, will concern us in this chapter. We shall further show 
that these polynomials are related to one another by means of 
first and second order difference relations and by means of re- 
currence relations involving the ( 7741) th, 77 th and ( 77-1) th 


polynomials, and shall illustrate these equations with the Poisson 
exponential function. 
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2. For convenience denote the numerator (@, + a,x) in equa- 
tion (1) by /V, and the denominator ( 6, + b,x + 6,x*) 
by D, . We may then define a set of polynomials by the follow- 
ing theorem: 

THEOREM: If Uu, is a non-identically zero solution of 


> 4 


x 


a Q, 7-1 


then ‘ D,, D 


“ti 


Ais a polynomial of degree 


at most 77,1.e. Q,, (x). 


The proof will proceed by mathematical induction. If we 
recall the formula for the difference of a product 


A[u,vj=-v,4u,+u,,, Av, = Vv, du, +(u,+Au A tis 


we obtain by differencing 
DAuy ae N, uy, * Y, uy 


the equation 


D474, +Au, AD, -Q, (2)+ 49,0) Au,+ u,-4Q,@). 
Using the value for Au, from the original difference equation 


and multiplying the equation through by D, , we obtain: 


D,, Dy, O°u, =[Ny 9, 0+ DD Q, 4) + Nz AQ, &)4N, OD, |uty 


xl 
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Since the coefficient of uw, is a polynomial of degree at most 2 
in x, we write 


Dy Dy, 4c, Q(X) icy . 


Let us now assume that the statement holds for 777 < 7, i. e. 


77 
D, 2, Pgs Many’ = eeat 4 uy ™ Y, (Wily . 
Differencing both sides of this equation gives us 


DB Dio Dean 1A” ey OL, +AU NODB,,D,;: 


X+77-1 tl Ki2 een J 
= Y, (Ju, +(u,+4u, )4Q,, (x). 


Now 


AD, DB 


#1 en-1 


7 wee 


x41 “242 “2B Qi , 


+77 


*(D44° , ‘Dayz n-1 UD, xt” “2 , 


Hence by the definition of Q,, (x) 


A|D,D 


a4 


2+7-1 


-D 
Ai, “een On Od uy 


Substiiuting these values in the above equation as well as the 
value for Aw, from (1) and multiplying by D, , the equation 
reduces to 


DD, ,,Di,5° -Diyy Ae, -[\,@, (x)+D, AQ, (x) 


x X41 x42 xt 
+N,4Q, (x)- } am (x) 4D, Q, le)\uty 


The coefficient of uw, on the right hand side is a polynomial of 
degree at most 77 in x . We therefore conclude that 
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DD, Dyan Au, “Qos (Bey. 


We have also succeded in deriving a relation similar to relation 
(I) of Chapter II, i. e. 


Ppa 1) =(N, 4D, - Dy, Q(x) 
(XI) 


+(N,+D,J4Q,, (x), 


a relation which shows that the ( 77+ 1) th polynomial is made up 
of the 7th polynomial and the difference of the 77 th polynomial. 
This relation differs from relation (I) in the fact that the co- 
efficient of AQ, (x) is N,+D, instead of D, . This 
change seems to be connected with the fact that the original dif- 
ference equation 


D,4u,=N, uy, 


can also be written 


Formula (XI) may also be written 
(XI) Q,,, 0) =(N,+D,) OQ, (2+1)-Di py Py) 


sincee 8 @(x)+ Y,, (x)= Qn, Ce). 


It seems advisable to adopt a notation for the term 
Ll, P41 Dy : ** Day aes 
since it will continue to be involved in the work that is to follow. 
The difference notation 'x“”= x (x-1(x-2)--- -(x-m+l) 
suggests that we use the symbol RB 7 i.e. 
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(77) 
Di”"- DB 


x x 


—~“e-2 ‘Dy net: 


Then we will have 


(77) 
=D 


2¢+7-1 


DD D. zt+n-l1 


Xx x42 


AD” -D D 


x+17-1 21m “x4n-1° 


‘Mo 


X42 XH 


(n-t) 
: 2, n-1 My 7-2 Qu 2 =Qyin -2) Misa ° 


3. We may also define the general polynomials Q, (77 x) 
where 777 is any integer, by means of a theorem as follows: 
THEOREM’ If 44, is a non-identically zero solution of the 


difference equation (1), then 
(7) 
x-777+77-1 


D iond™ 


X-f x 


isa polynomial G (7x) = ,and Q(77, x) is at most 
of degree 77 in x . In particular if 77:77 , we have 


A” oe u, | 


x-1 


A 


x 


i [+ (77) a 


x-1 


is a polynomial in x of degree at most 77. 
This theorem may be proved by using the following lemma: 
If uy satisfy the difference equation (1), then 
, where 777 is any positive integer, satisfies a differ- 
ence equation of the same type, viz.: 
(7) . 
A[D?” a} = Dey uy, GA *D, D _— 
x-1 a 
nr 
X-777 
The proof proceeds easily by mathematical induction. 
For 777=1 we have 
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Dy. 4] = D,4u,+u, 4D, 


=N,u,+4D,_,u 


? 


@D_»4y = +2, atl 


x-l 


For 77 =2, we get 


u,]- 2,4 [D.., uy] +D,, 4,42 


x-2 


Dye Dau thy ke ee) 
D 


x-2 
Mf? 6 HO x. eee | 
Dy-2 
Let us assume that it holds for the 77th case, i. e. 


D, 


x-9n 


ALD} fm) a). Ru | 
x Sa aa 


Then 
A [Dy 1 Oe ey) = Dey MOL eh DO” u, ID 


x-™-1 


(mm) 
a uy W,+D,-2, a 


-77 x — 


x-1 


x-77-1 
Making use of this lemma in proving the last theorem, we 
note that 


Do” a an [M, + Dy - Dy. 7 
RQ -™7 met 


or Dn Bm 4 10” ost, [Mz + 2.2m] 
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and in general that 


an i, J Des al Q,, (™, x)| 


x-774+77-1 


or Fg A’ (DL7” u,] = as oe ~ Da Cm, x). 


-m7I 40-4 

In particular, if 777- 77, we define the polynomials Q_ (7, x) 

as A” [Dy ged url= 9, (7,x) uy which relation is of in- 
terest because the A” has no D,, as multiplier. Any result de- 
rived for the polynomials ?, (%) = 4 Fed t 4 uy where uw, 


is a solution of the difference equation (1) can now be extended 


to the polynomials G,, (777,x) = panne by replacing 
xu-f 
N, by (N, +D, ae and D, by a. . For ex- 


ample, relation (XI) becomes 
Vag 1 x) = (N+ Dy -De_ nj p41) Um (m1), ©) 
(XI,,) 
+(N_+D,J AQ, (™m+41, x) 


and when 777-77 , this relation reduces to 


Q,44 (741%) = (N, + D,-D,_ JG, (nA HN ADJIQ, (7141, x) 
(XI,) 


=(N, +4D y1)Pr, (st, x) +(N,+D,)IQ, (741, x). 
4. In analogy with the work of chapter II, we next proceed 


to find a recurrence relation involving the ( 7+ 1) th, 7 th and 


( 77- 1) th of the polynomials @ (x). We take the 77 th differ. 
ence ot both sides of the equation 


D, Au, = Mt, 
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by making use of the formula for the 7 th difference of a product 
a” [uy v= vy Uy +7Qv, a” at 


+ OY A A” Une Hew w a 


We then obtain the equation 


ved. 7 77(7-L) 2 7-4 
DA u,+nAD,4 Gye?” of 4 D4 “.0e* 


N,A”u, +7 AN, A” 


x+l? 


A°*D and A*N, being equal to zero. Multiplying through 
by Df” 
y we get 


x+77 


(™) ,7tl (77) ” 
Din 4) “e+ MD, ADA" Uy, ,, 


(77-1) (nm) ,2 7-f 
7 2! Di404 D, 4 Cyez2 


(7) 
= N,D, 


X¥+77 


” (7) re£ 
4u,+nD,,, IN,4 Cyel 


2 2 
But u,,,=u,+4u, and u,,2 = 4, +2Oy4, + A*u,. 


Substituting these values in the last equation and using the defiin- 
ition for the polynomials @,, (x) , we obtain: 


Ong Dit a [Deon Q,, CD) + Prigy (x) \ uy 


nin-1) D A*D @) 5a | 
1G Baud Be (De, ot 2D 0 Mal 


x41 


N, Dan AN, 
= D, Dyn Pq Oey + TILy % a Pn (x)| 4, 


x 


Dividing through by «c, and collecting like terms, this expression 
reduces to 
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| nAD,  nln-l) A*Dz = AD, WnDaRy 


x+77 
2 D Ben 


NzDysn_ 7Dyin NZ n(n-1) 
er 


x xz 


2/D, FD Ben-1 Pein 4 Dy 
“ Deans D. AN, 
a | Oye (220. 


Now we know that 


77( 7-1) 
Usin * hy +7Mu, + a Ly +---- 


and so we may write D,,, and W,,,, in this same torm, i. e. 


Dyn = Dg + MAD, + "OT? AD, 


AD 


x+77 


= AD, +nA"D,, 


and Nyan = Net 7ANx 
the third and higher differences of Dy and the second and higher 
differences of MV, being equal to zero. The coefficient of 
On41 (0 reduces to {—2#7 and the coefficient of 
Q,, (x) also reduces to a simpler" form. Dividing through by 


2 Dron we finally get the recurrence relation: 


x 


Qraq Ct OD, 1 > Nean? O, 62 
(XII) 


+7D. 


x+r7-1 


es 1) 


A*D,-AN,| Q,.1@) +O 


i. e. the ( 7741) th polynomial may be obtained from the 77 th and 
( 7-1) th polynomials. 

In Chapter II we found that relations (1) and (II) were 
identical for the first two terms, and as a consequence we equated 
the third terms and obtained a relation between the derivative of 
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a polynomial 2 (x ) and the polynomial preceding it. In order 
that we may obtain a similar expression for the difference poly- 
nomials, we must change the appearance of formula (XII). 

By lowering the degree in formula (XI ) from 7 to 7-1 
and solving for D,, , O,., () we find that 


Doan. Gy.4 CD= (Ny +D,) Qn-1 (x+1)-Q, (x). 


Substitution of this relation in formula XII gives 


Qe C= Neg 9 MAD yy 9-4), 


+ ra N,-CE#a*7 2 |, +D,)Qp.1 (#49-O,, (a 


- Ones (%)= or -7 AN, - - 7 AD, n3* a 4°22, (0 


” n[AN,- co “7D, N,+D.)O,.1 (z+1). 
Just as in Chapter IV, paragraph 3, the coefficient of © 7 (%) Te 


duces and becomes the same as the coefficient of @, (x) in 
formula (XI) and we have 


Q,,,4 0) =(N,+D,-D,,,) 0, 


2477 


+ 7fON,- 32 *2\(™. +D.)Q., Gel) 
We therefore conclude that 
(XIII) AQ, (x)< n[AN, - S¥a2D, ]Q,., +n, 


a relation expressing the difference of a polynomial @,, (x) in 
terms of the next preceding polynomial in (x+#1), i. e 
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Qn, (%+1). For the polynomial Q_ (7, x), formula (XII) 
may be written in the form 


(XIN) 1Q, (nx)- [ON + C2" 2°Q1Q,, (n <+2), 


this relation being obtained by replacing NV, by (Ny, 2,-2.4) 
and D, by D, 

Formula XI{1 which was just derived is the general form of 
the relation we found to hold for the Poisson exponential function 
polynomials, i. e. 


AQ, 14) =- 7Q,,1 (#1). 


We find further that these polynomials satisfy a special form of 
(XI), i. e. 


O41 A) #24 n+l-A)Q,, (0) -AQQ,, (x) =O 


and for formula (XII) we get the special form 


O49) + Cer 2rrtl-A)Q,, Qe) + (247) @,,_,(%)-O 


This recurrence relation is also similar to the one given for La- 
guerre polynomials. 

5. Turning now to the problem of obtaining a second order 
difference relation for the polynomials Q,, (2) , we proceed to 
difference formula (X1), i. e. 


Q,,410)=(NztD, ~D,, IQ, 2 + (NM, +D,JA®,, Ce) 
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and get 
AQ, 44 02) = CAN, +4D, -AD,, 7) QD, () 


+N, 41 + Days "Memes )4 Y, (a 


+(AN, aeiggiie (0) + (Nyy + Doug) 4° QO, (2). 
Substituting for JG_,, @ the value 


(nel) [4% -Z a“ [2 (=) +4Q, (2) 
found in formula (XIII), gives us 
(m+) [AN, -7a*D| (9, (y+ @,, (x) " 
[An, +4D, AD, en] (x)+ [Wy xe * Bes” a AY, (x) 


+ (AN, 4D] 40, Ce) + (Nig + Deu) 4°O, 0). 


Collecting the coefficients of like terms and simplifying them, we 
finally get 


CN, “+4 + DJA" Gn (x)+[N, x-7704 (n-1) AD, | AQ, &) 


(XIV) 


(7-D) 


-nfAn, - $F” a°D,) Q,, (x) +0, 


a relation very similar in form to formula (IV) and consisting 
of the first and second differences of the polynomial Q),, (x) 
This relation when applied to the Poisson exponential function 
gives 


AL“Q, (x) + CA-%-1)AQ, (x) + 1Q,q (4) =O, 


an equation which can be checked by substituting the value of 
the general Poisson polynomial in it. 

The extension of formula (XIV) to the polynomials @,,(77,x) 
and Q,(7,x) by making the proper substitutions for /V, 
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and D, results in the following expressions : 


(N, xe1* Deh Q,,(mx) 
(XIV,,, ) + +[M,. 7+L * Dames _) -n NAD, ny| 49, (mx) 


ON, +AD, AD, -(7F) AD, ny \Qy (2) O, 
which may also be written as: 


(N, ‘v+1* Dp4)4°Pn (7,2) 


[Nemes +(m-nat AD, - =F? g?p| AQ, (mx) 
-n|AM, - nem-t np l a, (m7, x)=O 
In particular if 77-77 we have: 


(N.447 AE, GQ, (nx) + 
(XIV, ) +[ M44 +4Q- 7S a*D] 49, (n») 


- n[AN, + cng aa | Q,, (1, x)= 0. 
6. The next set of relations we shall derive are recurrence 
relations for the polynomials G,,/77,x) and @,, (7 x) 
In the lemma proved in this chapter we found that 


(Do *"4,) [2.4 ‘~ 4, |[% +D, -Dy. gh 
Taking the 77 th difference of both sides of the equation gives: 


BD 1.) +N, DD, mA" DLF” 24] 


<I ¢779) 
+n(AN,-AD, AD, py, A” 1,1) 


the second difference of the trinomial (/V, + D,-D, ie J being 


equal to zero. Multiplying this last expression through by 
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Ft . “é and substituting for A, = Pd Am Dimes 


the value D7" @ |, (m,x)u,, we get 


(met) lrael 
Dy Oras (m+l x)u, = (N, +D, Dy .m.1I Des "Cn (m xu, 


dN, +D, 
+7(4N,+AD,-4B. JQ” AF) Ont (om, x+l) uy. 


Dividing through by ee" ™ we get a recurrence relation 


involving the polynomials @,,,(7+l)x), Q,, (77,x) 
and Q,_, (7m, x+1) , i.e 


Pret (rel x)=(M,+D, Dy. 7-1) Py, (7,x) 
(XV77 ) 

+ nl AN, tm) LD, | (N,+D,) Q-4 (rq x+/). 
For 777= 7, this expression reduces to: 

Onset (m+1.x)=(N,¢D,-Dzn_1) Om (7x) 
(XVp ) 

* n[ AN, + (nel) A° D,,| (N,,+DJQ,.1 (mn, x#1). 


7. Another form of this relation is obtained by substitut- 
ing the value found in (XIII,,) for @,_,(m,x#1) , i. €. 


fl 


1 GO 
Q,,.4 (7, ¥4L) nl AN, +737 A*D,\ 


AQ,, (7, x), 
in formula (XV», ), which gives 

Qyrgy (744, 4) = (Nog t Dy Day. yg) Oy (722) 
(XVI) 


AN, +(n+1)A*Dz 
a a | 
+(N,+D,) AN, + (r+1 A*D, Q, (nx) 
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a relation very similar to formula (VI). 
8. There remains one more formula in Chapter II for which 
we have not yet found a parallel in this chapter, i. e. formula VII. 


To obtain this parallel expression, we difference formula (XVI), 
thereby obtaining : 


ADnt (M41, x)=(AN,+4D,-AD,_,,_,) %, (7x) 
+N. 44 + Dias - D,.,) 49, (7%) 


AN, +741) A “dD. ; 
AN, * PLD, [ an, +AD,JAQ,, (77, %)+(Ne 44+ Dy 4p 4Q,(n4}, 
In formula (XIV,,) we found a value for 


(Nyo1 + Dy) 4° Q, (7x) 


which when substituted in this last expression gives us: 


AQ, (744, 2) (ON,+4D_- AD, eas AHN yyy * Diy 2D. 40, (nx) 


AN: +(n+#1) A*D mined, 2 
‘AN, CDAD, [AN t AD, + BAD, - Nenu AD4O {00 


AN, +(m#L)A* D, (mgt) 
+7 TN + OD AD + EP 4°D, [4n, + 


AD, 0, (nx). 


Collecting coefficients we get 
AQ a (741, x)-[4 N,+nAN,+AD, -AD,.,,.4+ n(net)A*D) Y, (nx) 
[Myo + Dass -Dn|On (7x) 


AN, +(n+LA* Dy. 


AN, DAD, lan,- N.nes* nin t) 4D] 20, (> 
+ 
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and by simplifying the coefficients this expression finally reduces 
to the formula 


AQ pug 7H x)= (ne1AN, +(n41)A ‘D, | @, (n, x) 


a | t(D, - - FP", 


(XVII) 
- AN, +(7+)A°D, a (nel) n2r, 
ewe se |Mwr 7S 2.) 4, ny 


a relation which is also similar in form to formula VII. 

Before concluding this chapter, we might examine the char- 
acter of the polynomials @, (77 x) when the original func- 
a 
tion is the Poisson exponential function Y/%)= — 
We find these — to have the following form: 


ze" A 


Fo =Q (,x)=A-X, 


@) 
rasta Q, (2,x)= N2Ax+ex@ 


7X A 4x 
*Q; (3 x)= D?- 3A*x 4 3Ax@ 


reg. (n2=N nA” 


ard) 7-2 ~ nin) 
a2! 


A x -t¢ Dx, 


:[A-2”|' Q,, (x-n). 


Substituting the proper values for N, and D, in formula 
(XIV,,) we get 


AL*Q, (nx) HA-x+n-L)AQ, (nx)+ 7 Q, (7,x)= O 
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In the same way we find for formula (XI,,) the relation 


Qnu1 (741, x)= (A-x+4 71)Q,,(7,x)+ AAQ, (7, x 


and for formula (XVII), the reduced relation 


40,4 (74h 2) =- Cr+) Q, (7), 





which is somewhat like the relation obtained for (XIII). 
We might call attention to the fact that these polynomials are 
identical with the polynomials obtained by Charlier’ satisfying the 
relations 






-A yx, +0 = 0 for m7 
5” lz, we” —— 


x! 4.4 «<1 for m-<7n 















9. Summarizing the results of this chapter, we have found 
that if the general solution G(x) of the difference equation 


a. + &,& 
‘Oo 2 iL 
b, + 0,x + b, x* 


Au, = - 4 





is used as the generating function g (x ) in the Charlier Type B 
series, that the successive differences give rise to two general 
types of polynomials which we defined as follows: 











x% 


1 (7m), 7421 





f 


Uy, 


y 77 7 07) ins 


x-77 a 


QO, (1, x) = 


With the aid of the properties of the Z| operator, we derived 
a set of relations and equations for these polynomials of the fol- 
lowing form: 


tC. V. L. Charlier: “Ueber die Darstellung willkiirlicher Funktionen,” p. 34. 


SYSTEMS OF POLYNOMIALS 


(XI) Qivs=(N+D,- Dn dQ, (x) +(N,+D)A Y (x) 


(XIn) Or 2=(N,+AD JO, (41 x)+(N,+BIAQ, (72) 


Qo) (2) = (Ny, 7 7 7AL, 7-1? Gp (2) 
XII 
( ) (7-1) 02 
e nL, nL 4N,- —z 34 2,| Pn (2) 
xa! Qu) =(N, ‘D.-2,, 7) Wp (x) 
( ) (7-1) 2 
+ nan, - SP 02D) (MD_DO,, (241) 


(XIII) AQ,(@)- nlANn,- 07D) Q,, ery) 


(XII,) AQ, (x, x)= n[IN, + 73? A*D] O(n x40), 


(Nut D4 4)4 “Q, (z)+ [A,. 7+] -(n-OD| AY, (x) 


(XIV) 
- nlan,- 02D] Q, (<0 


(XIVp) Mis +D,,,)1°@, (nedo[N, syaqt Dy: mer AD (nx) 
. n 


nln, + 4 2*0,]@, (n2)20 


Giat (r+L x) =(N_+D, ‘Mngt % (7.x) 
(XV,,) 


+ n[ AN, ’ (r+) 472| (N,+D)Q,.4 (nx+1) 
1 (41,2) = (Nt Dn )Q, a2) 


AN, +(n#1)4*D, ) 
NB) Ans TAD, 


(xXv1) Wns 
AQ, (7,2), 
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(XVII) AQ, +lx)=(7+1) [AN, +(n+1)A*D,| Cine 
+ N44 + C7#L)AD, - 72(m2l) 727 


AN, +(r#DA*D, ees 
aa N, + Cpl) A*D Mn ’ a°D] f49, (32). 


Each of these formulas corresponds and is similar to a for- 
mula found in Chapter II. In fact, it seems probable that if we 
developed the formulas in this present chapter from the equation 


4a, . &% 


x 


= al 
Ax D, - 


and permitted the 4, to approach zero as a limit, the formulas 
of Chapter II would result, the above formulas being the case 
where Z| ~~ 











A NEW FORMULA FOR PREDICTING THE 
SHRINKAGE OF THE COEFFICIENT 
OF MULTIPLE CORRELATION 


By: 


Dr. R. J. WHERRY 


Cumberland University, Lebanon, Tennessee 


With the perfection of the Doolittle Method for the solution 
of the constant values necessary for the multiple correlation and 
prediction technique, we may expect a constant increase in the 
use of this method in statistical practice. Theoretical statisticians 
have recognized for some time however that the multiple correla- 
tion coefficient, derived from a large number of independent vari- 
ables, is apt to be deceptively large due to chance factors. When 
prediction equations derived in this manner are applied to sub- 
sequent sets of data, there is apt to be a rather large shrinkage 
in the resulting correlation coefficient obtained, as compared with 
the original observed multiple correlation coefficient. In order 
to avoid overoptimism it is necessary to have some equation 
which will predict the most probable value of this shrinkage. The 
development of such a formula is the purpose of this paper. 

The most promising formula of this type so far developed 
is the B. B. Smith formula, presented by M. J. B. Ezekial at the 
December, 1928, meeting of the American Mathematical Society 
held at Chicago. This formula is 


(1) 
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where R= the estimated correlation obtaining in the universe 
R = the observed multiple correlation coefficient 
Mz= the number of independent variables 
N= the number of observations (the statistical popula- 
tion). 

This formula was evidently developed by B. B. Smith by an 
application of the method of least squares as follows (the deriva- 
tion is that of the author, since he could not find it given else- 
where) : 

The customary formula for the coefficient of multiple corre- 
lation may be written in the form 

2 


(2) p*= 1-22 


4) 


2 
where 


(3) 


where 
(4) 


The method of least squares, however, says that the most 
probable value of the standard error of estimate is not that given 
in equation (3) but 


2 


(5)! 


a 
N-M ° 


Now, if we substitute the value of (5) in place of (3) in 
equation (2), we have at once 


1See Merriman, Method of Least Squares, John Wiley & Sons, London, 
8th Edition, pp. 80-82. Also see derivation later in this paper. 
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— 2 N 
(6) | a 


a* N-M 

and since , by (2) above, we have s? equal to (1 - R . ), 
Go" 

we have 0 


2 2 
7) p*. 9. N(1-P ) iy. 1-R 
N-M f- M 
N 
which is, exactly, the B. B. Smith formula (1). 

This formula has been widely used during the last few years, 
but up until recently had not been subjected to much critical ex- 
amination. However, in a recent article in the Journal of Educa- 
tional Psychology * S. C. Larson actually tested the formula em- 
pirically on some data obtained from the Mississippi Survey con- 
ducted by M. V. O’Shea, obtaining the results indicated in the 
tables and graphs below, and on the basis of which he reached 
the following conclusion: 

“The Smith Shrinkage-Reduction formula parallels all of 
the empirical findings but quite consistently gives values which 
are in excess of those obtained under present experimental con- 
ditions.” This meant that the Smith formula predicted shrinkages 
consistently greater than those actually obtained. 

It was in view of this reported empirical difference that the 
writer started his attempt to derive the Smith formula and hit on 
the method given above. The question at once arose in the writ- 
er’s mind as to why, when the standard error of estimate had been 
corrected to correspond to the most probable value by a least 
squares criterion, the standard deviation of the dependent variaile 
had not been treated in the same fashion. 


“The Shrinkage of the Coefficient of Multiple Correlation,” Jan., 1931, 
pp. 45-55. 
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Merriman, whose formula we used above in correcting the 
standard error of estimate (5), likewise, and by identical reasoning, 
shows that the most probable value of the standard deviation of the 
dependent variable existing in the universe, should really be rep- 


resented by the following relationships: 
Where 


Zz, z 
(8) . Sage 


we find 


(9) 


(10a) 


and when the same substitution is made as in step (7) above, we 
have 


=2 (N-1)R*(M-1) 
= N-M 


(10b) 


which is, by a more correctly applied criterion of least squares, 
the formula we have been seeking, and is a closer approximation 
than that given by the Smith formula. 

The reasons for the substitutions made above in our formulae 
may not be entirely clear to all readers, so we now present the der- 
ivations of the formulae given in (5) and (9) above. The deriva- 
tions given here are directly adapted from those of Merriman 
referred to above, but have been translated into the customary 
statistical notation whenever possible. 


First, let us consider the derivation of the value in (9). As 
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stated in (8) the most customary form of Sigma is 


a fu? 






(8) 


where 


(11) 










Each value x, has a certain error, however, due to the 
fact that the value of the mean is merely the most probable value, 
not the true value. So for each x, value there is a small un- 
known error 6x, , so that if we take x > to be the true value 


of a deviation we have 









(12) X= %,+ Sx, 


(4) 





and, squaring and summating, disregarding the terms involving 
second power delta terms as small in comparison with the first 
power terms, we have 













(13) i» wis x re 22, x, Sx, 





Now, by the laws of probability, we know that the probabil- 
ity of the occurrence of an error %, , whose measure of pre- 
cision is “h,” is 


z*h* 





(14) Nehdzn te! 





multiplying both sides of this equation by Z* and summating 
between the limits plus and minus infinity, we have 













ee 
(15) Zz ‘of a). ae. 
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: ait. 2x . . 

and since ZJIX is the same as ~jj— , since in our work we 
assume the weight of each value X , for each of the VV observa- 
tions, to be 4 , we have 


_2 
(16) az .i. 


N 2h* 


or 


N 


a2 
(16a) 2: = 372 


Likewise, if we let 
(17) EL X, bx, =-u* 
the probability of the system of errors, cz“, is 
2 


2 
(18) Wh «sataw te *" 


and the mean of all of the possible values of u* is 


+00 
h oe 1 
(19) we [ u%e Au = x52 


and this must be taken as the best attainable value of ry But it 


a2 
was shown that the quantity any is equal to ae (16). 
Hence 


—@& 
(20) Exe Lxte SE 
from which 


(9) 5" 
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which was to be proved. 

To derive (5) we proceed in much the same manner. After 
our normal equations have been solved for the most probable 
valuesof ,,, 6,2,fo3,°° * * Aor, for our set of data, 
we know that these are not the true values, but that they err by 
small unknown corrections 06,,, 562,545, ---: - - Bom» 
the corresponding true values for the universe being (4, *5@,), 
(Se . SS, 2 ), (As +055 ), ee (4ea* rn 

Now, if we substitute the most probable values of the Betas 
in our original observation equations, they will not reduce to zero, 
bus will leave small residuals yY ,%,.. . Vy » thus 


%o,~ %0y"Ko1 * 14 +2 “.,* “~—""*” +Lam i_,- * 


x.-%, = e+ 
oO, % OL “I> 


Lon Xo y= Fos 1° ny X03 Sy I ig My OF, 


™m™ T7n Cn N 


while if .the corresponding true values be substituted, we obtain 


(Bos +66,,)%,, HB. _ Ke)*2, etre *(Bo mn + Pom) * mo, ¥ 


(64+ 6By,) *1, (Poet : G2) %,, +++ fg + Bm D% 


(4, +§6,,)%,,, * (Bet hq) 
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Subtracting each of the former equations from the latter, we obtain 


v, + 08s: *1, +6, or ~2,* aati +S, mn 7% 


2? fu %,* Pig % + sone + fm, = %e 


Vay * Df o1%1,,+ Pz * ee + Yom *m," = 


Now the principle of least squares provides that 2 Vv = shall 
he made a minimum to give the most probable values of 6, 
Kye , » » »- fh, , and by the solution of the normal 
equations by the Doolittle method its minimum value is found to 
be 2 v* . From the residual equations we may find the re- 
lationship existing between the values X 7* and Z v2. Thus, 
if we square each equation immediately above and then summate 
we have (if we neglect squares and products of the delta values 
as small in comparison with the first powers) : 


(21) Lv*+266, 2x,v+266, Lx, v+---+256,, Ex veZV* 


which we may write as 


(22) avs uzeuse ke eee oe +u,,*=Z0* 


Now, by analogous reasoning to that in steps (14), (15), and 
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(16), we may set 









(23) Lvz- A> 








Further, if there be but one independent variable, there will 
be but one 256.5 x,v and its value by the same process 
used in steps (18) and (19) can be shown to be 


(24) 
and since that is true whichever unknown quantity be considered, 


the values of each uw a value must be xe ; and as there 
are © of these values the above equation (22) becomes 












2" _WN 
av *2Zh2 2he 
from which 


(25) ia fe e.. 
ZEv*® 


Therefore, from the constant relationship which exists be- 
tween the value “A ” and the Probable Error, we have 


av 
N-M 









(26) PE, -O0.6745 








and therefore, by the relationship existing between the probable 

error and the standard deviation we have at once 
_ 2 

(5) ox _Z2yv 


v ~%0 = WeM 








which was to be proved. 

The next step was to test out the formula empirically. 
This was done by using Larson’s material, with the results in- 
dicated in the tables below, and in the graphs which show the same 
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set of facts, but which make the results much more apparent. 

An inspection of the tables and graphs will show at once 
that the new formula predicts what will actually happen much 
more accurately than the Smith formula did. In graph 1, for 
example, the agreement is so good that the results appear almost 
to have been a regression line fit to the particular set of data. 

It was to have been expected that if the formula actually 
predicted the most probable values of the correlations obtaining 
in the universe that the errors incurred by the use of the formula 
would be normally distributed around zero as a mean value. Graph 
3 presents a comparison of the error curves obtained by use of 
the Smith and the Wherry prediction formulae, together with an 
approximation to the normal curve. As a further and more sci- 
entific check the criteria for a normal curve as set forth by Rietz’ 
were applied to the data. His criteria are 


7 
x 
where u,,= ax 


The results for the two formulae are given below. 


(Results based on an expectancy of zero) 


Smith Formula Wherry Formula 


'Rietz, H. L. Mathematical Statistics, Carus Mathematical Monograph 
No. 3, Mathematical Association of America, Chicago 1927, pp. 58-59. 
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It is apparent therefore that the Wherry formula gave much 
better results for both the first criterion (mean error) and the 
second criterion (skewness), but that the excess was greater for 
the Wherry formula than for the Smith formula. However, one 
cannot quarrel too much with getting errors actually smaller than 
would be expected by assuming normality. Even this superiority 
is seen to be fictitious if the distributions are measured from 
their own means rather than from an expected mean of zero. 
When this is done, which is the manner in which the criteria are 
customarily used, we have 


(Results based on means of distributions) 


Smith Formula Wherry Formula 





Thus,we find that the Smith distribution has, in reality, even 
a greater excess than does the Wherry formula, but has it at a 
point farther removed from the desired value. 


SUMMARY AND CONCLUSIONS 


1. Larson has shown that the theoretically expected shrink- 
age is an empirical fact. 

2. Larson has shown that the Smith formula, when tested 
empirically, consistently over-estimates this shrinkage as deter- 
mined empirically. 

3. It has been demonstrated that the new Wherry formula, 
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both by a least squares criterion and by actual application, is more 
nearly true than the corresponding Smith formula. 
4. The correct formula for the shrunken coefficient of mul- 


tiple correlation is 
_» (N-1)R* (™M-2) 
R ee 


N-M 
where R = the estimated correlation obtaining in the universe 
R= the observed coefficient of multiple correlation 
M-= the number of independent variables 
and = We= _ the number of observations (statistical population). 


Wy 
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GRAPH 1. 


Shrinkage as Obtained by Use of the Formulae and Also as 
Obtained Experimentally 


(Data from Table I) 
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GRAPH 2 


Shrinkage as Obtained by Use of the Formulae and Also as 
Obtained Experimentally 


(Data from Table IT) 
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GRAPH 3 


Ogive Showing the Distribution of Error in Predicting Shrinkage 
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TABLE Iil 


Showing the Mean Error Attained by the Use of the Smith and 


Wherry Shrinkage Formulae. 


Formula Table I Table IT Tables I and II 


Smith . 





THE USE OF THE RELATIVE RESIDUAL IN 
THE APPLICATION OF THE METHOD 
OF LEAST SQUARES | 


By 


Wa ter A: HENpRICKS 
Junior Biologist, Bureau of Animal Industry, 
U. S. Department of Agriculture. — 


The method of least squares offers a precise method of fitting 
a curve describing the relation between two or more related, meas- 
urable variables, but certain criteria must be fulfilled to justify its 
application. First, the type of equation selected for fitting must 
be the true mathematical expression of the law governing the rela- 
tionship of the variables. Secondly, all errors of measuremeat, 
made in obtaining the observed values of the variables when the 
data were collected, must be distributed according to the well- 
known laws of probability.’ 

This paper is concerned with the latter of these two criteria. 
The fundamental theory upon which the method of least squares 
is based can be found in any text-book on the subject and need 
not be elaborated upon here. However, it may be well to point 
out a very pertinent, if somewhat elementary, aspect of the theory 
which facilitates the ready visualization of the fundamental cc a- 
cepts involved. 


1Steinmetz, C. P. Engineering Mathematics. McGraw-Hill Book Co., 
New York (1917). 
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The application of the method of least squares to curve fit- 
ting, as ordinarily described in works on the subject, is perfectly 
analogous to the calculation of the arithmetic mean of a number 
of measurements made upon a single, coristant quantity. This 
may be easily demonstrated as follows: 

Let Y= £(X) describe the relation existing between an 
independent variable, X , and a dependent variable, Y. If it is 
desired to find the most probable value of the dependent variable 
when X has some definite value, X,, the most direct method of 
procedure would be to make a number of measurements of Y 
at this value of X and calculate their arithmetic mean, provided, 
of course, that the errors of measurement were distributed accord- 
ing to the laws of probability in a normal frequency distribution. 
According to the elementary theory of statistics, the most prob- 
able value of the dependent variable, f(X,) , would be such 
that the sum of the squares of the deviations of the actual meas- 
urements from this value would be a minimum. 

If X is conceived to be varying in value so rapidly that it 
is impossible to make more than one measurement of Y at any 
value of X , this direct method can not be employed. However, 
the most probable value of #(X 1) can still be determined. 
Let Y,, Y2, Y3, . - - Y> each represent a measured value 
of Y at values X,, Xz, X35, . . . Xv, respectively, of 
the independent variable. Since the errors of measurement are 
assumed to be distributed according to the law of chance, an 
error of a given magnitude is equally likely to occur at any value 
of X . In other words, exactly the same errors would be made 
in obtaining one measurement of each of the quantities Y,, Y2, 
Y,, . . . Y,asif £(X;) were measured 77 times. These 
errors may, therefore, be considered as having been made in meas- 
uring a single, constant quantity. Therefore, if #/(X) denotes 
the most probable value of Y at any value of X and Y de- 
notes the corresponding observed value, the most probable values 
of the dependent variable which can be calculated from any set of 
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data are such that the sum of the squares of the differences, 
f(X)-Y _, is a minimum. 

It is important to bear in mind that this conception of the 
distribution of errors of measurement is justified only when an 
error of a given magnitude is equally likely to occur at any value 
of X . In actual practice it often happens that this ideal condition 
is not realized. The magnitude of the errors of measurement is 
often influenced by the magnitude of the quantity which is being 
measured. In obtaining the live weights of animals at different 
ages, for example, it is common practice to use a less delicate 
balance in making the weighings as the animals become larger, and 
the magnitude of the errors of measurement increases as the 
sensitivity of the balance decreases. Other factors which tend to 
increase the magnitude of the errors may also be in operation. 
The error, or rather the unreliability, of the weight of a 1,000 
pound steer would be greater than that of a 100 pound calf, even 
though an equally sensitive balance were used in making both 
weighings, because of a greater content of material in the digestive 
tract and excretory organs and the increased effect of the move- 
ments of the animal. 

It is highly probable that in many fields of investigation such 
disturbing influences are encountered more frequently than the 
ideal conditions which justify the application of the method of 
least squares as ordinarily described. 

Pearl and Reed recognized the need for modifying the ap- 
plication of the method of least squares to compensate for changes 


in the probability of the occurrence of an error of any given mag- 
nitude and suggested, as stated by Pearl,! that it would be more 
ei . : F(X) - 
logical in many instances tu employ residuals of the type —— 
The use oi such residuals was based on the assumption that if 


the errors of measurement were expressed as percentages of the 


1Pearl, Raymond. Studies in Human Biology. Williams & Wilkins, Bal- 
timore (1924). 





W. A. HENDRICKS 461 


magnitude of the quantities measured, the percentage errors would 
be distributed at random according to the law of probability. In 
many practical problems this assumption appears to be justifiable. 

The study herein reported was made to determine the extent 
of the error made when the method of least squares as ordinarily 
described is applied to data in which the percentage, rather than 
the absolute errors of measurement are distributed according to 
the law of chance. 

The writer desired a hypothetical set of errors of measure- 
ment which, when expressed as percentages of the quantities meas- 
ured, would come as near as possible to forming a normal fre- 
quency distribution. 


TABLE I 


Ideal Frequency Distribution of 41 Throws of 12 Dice in Which 
a Throw of 4, 5, or 6 Points Is Considered a Success. 


SUCCESSES FREQUENCY 


=m nwo mowonwuahd = 
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Mills! gives the results of fitting a normal frequency curve 
to Weldon’s distribution of 4096 throws of 12 dice, described by 
Yule,? in which a throw of 4, 5, or 6 points was considered a 
success. If eaci: frequency, calculated from the fitted curve, is 
divided by 100 and the results rounded off to whole numbers, 
the frequency distribution given in Table I is obtained. 

If hypothetical errors of measurement are substituted for 


TABLE II 


Ideal Frequency Distribution of 41 Hypothetical Percentage 
Errors of Measurement. 





ERROR 
(Per cent of quan- 


FREQUENCY 


tity measured ) 


=m HMM e~owO oN hd = 


1Mills, F. C. Statistical Methods Applied to Economics and Business. 
Henry Holt & Co., New York (1924). 

2Yule, G. Udny. Introduction to the Theory of Statistics. Charles Griffin 
& Co., Ltd., London (1927). 





W. A, HENDRICKS 463 


successes in this frequency table, the resulting distribution may be 
considered to represent a distribution of random errors of meas- 
urement which might be made in obtaining a series of 41 meas- 
urements of a variable. The most probable error should obviously 
be zero. If the total range in magnitude of the errors is assumed 
to be from + 8 per cent to -8 per cent and the precision of meas- 
urement is such that each error differs from the next larger or 
smaller error by 2 per cent, the distribution of these hypothetical 
errors’of measurement should be as given in Table II. 

From the simple equation, Y= 100 X* 41 values of Y were 
calculated, using values of X from 1 to 41, inclusive. Each 
calculated value of Y was then changed by algebraically sub- 
tracting the hypothetical errors of measurement given in Table II. 
All the percentage errors of each magnitude were arbitrarily dis- 
tributed as uniformly as possible throughout the data. These altered 
values of Y will hereafter be termed the “observed” values and 
the original values, from which they were calculated, the “true” 
values. The observed values of Y , together with the true values 
and the assumed errors of measurement from which they were 
calculated, are given in Table III. 

In order to be certain that the errors were actually distributed 
in such a manner that the probability of the occurrence of a per- 
centage error of any given magnitude was the same at all values 
of X , the writer employed Pearson’s method of square contin- 
gency as described by Yule.’ A 16-cell contingency table was 
constructed in which the percentage errors were classified accord- 
ing to the values of X at which they occurred. The chi-square 
test for contingency was applied to this table. 

Table IV shows the actual distribution of the percentage 
errors, together with the corresponding theoretical frequencies. 
Since there are 4 rows and 4 columns of cells in the table, the 
number of algebraically independent differences between theoret- 


1Loc. cit. 
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ical and observed frequencies is (4-1)(4-1)+#1 or 10. The 
value of X . calculated from the data in Table IV, is 1.3171. The 
corresponding value of , which is the probability that as bad, 
or worse, an agreement between observed and theoretical fre- 
quencies could occur from the fluctuations of random sampling 
is, according to Pearson’s Tables," 0.996911 or almost certainty. 
The percentage errors were, therefore, distributed in such a man- 
ner as to be uncorrelated with the values of X at which they 
were used. 

The equation, Y=AX @ | was fitted to the hypothetical set 
of data in Table III by the method of least squares as ordinarily 
described. If AX* represents a calculated value of the depen- 
dent variable and Y represents the corresponding observed value, 
the difference between these two values is AX*- Y and the 
square of the difference is A*X*-2AX*Y+ Y*. The sum 
of the squafes of all the differences is A*2Z.X*ZAZXY +S Y~. 
The value of this expression will be a minimum when its deriva- 
tive with respect to A is equal to zero. Differentiating and 
equating to zero yields the following equations for the determina- 
tion of A: 


(1) 2AZX*-2ZX*Y =O 


(2) A-ZX*y 
=x 


The value of A calculated from the data in Table III by 
means of equation (2) is 100.6250. 

If residuals of the type suggested by Pearl and Reed are 
employed, A is calculated as follows. Let .4X* represent 
a calculated value of the dependent variable, as before, and let Y 
represent the corresponding observed value. Then the difference 
between the two values, expressed as a fraction of the observed 


1Pearson, Karl. Tables for Statisticians and Biometricians. Cambridge 
University Press, London (1924). 
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TABLE IV 


Chi-square test for contingency applied to the distribution of the 
percentage errors of measurement. The theoretical frequencies 
for each compartment are given in parentheses. 


Magnitude of Error (Per cent) 


0.0to +19 {22.0tot3.9 |44.0to+5.9 peeeee pera 


11 to 20 2 4 3 1 
(2.1951) | (3.9024) | (2.4390) | (1.4634) 
21 to 30 2 
(2.1951) | (3.9024) | (2.4390) | (1.4634) 
31 to 41 4 
(2.4146) (4.2926) (2.6829) | (1.6097) 


0.996911 
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value, is ax-x or ay - 1. The square of this relative 
deviation is Ay - 24x" +1 and the sum of the squares of the 
41 relative deviations is AL Ee - ~2AZ *= + 41. This expres- 
sion will likewise have its minimum value when its derivative with 
respect to .A is equal to zero. Differentiating and equating te 
zero, as before, leads to the following equations for the deter- 
mination of A : 
3 £ 


(3) 2As Sz-25 =O 


2 
(4) a= 
ive 

Applying equation (4) to the given set of data gives a value 
of 99.7573 for A. This value of A is closer to the true value 
100, than the value which was calculated by means of equation 
(2) but the improvement was not as great as might be expected. 

It occurred to the writer that if the deviations of the cal- 
culated, from the observed, values of the dependent variable were 
expressed as fractions of the calculated values, a more atcurate 
value of A could be obtained. ante 

The relative deviation expressed in this manner is AX* 
or {- -A¥. The square of this deviation is /- aay. at 
and am . sum of the squares of the 41 aati. dietitians is 


2 
Al- 2A'zie 14°*E $x. Differentiating this expression with 


respect to.4 and equating to zero yields the following equations 
for the determination of A : 


7 2 
(5) 2A *2%4-2a°s hee 


ode 


(6) A= 
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The value of A, calculated from the data by means of equa- 
tion (6), is 100.1210 which is nearer to the true value than either 
of the values calculated by the two preceding methods. However, 
it is evident that equation (6) failed to give results as precise as 
one would expect, in view of the method by which the observed 
values of Y were obtained. 

The reason for this discrepancy can be made most apparent by 
returning to the analogy existing between the. application of the 
method of least squares to curve fitting and the calculation of the 
arithmetic mean of a number of measurements of a single, con- 
stant quantity. 

Let 777,. 77%. 773. . . . 77, vepresent measured values 
of the same constant quantity and let their arithmetic mean be 
represented by 47. If each measurement is divided by the arith- 
metic mean of all the measurements, the resulting distribution of 
these relative values will be normal if the original measurements 
were distributed normally. The arithmetic mean of these relative 
values will obviously be unity. 

Let HF, ze, 7. — airepresent the relative values of 
the measurements. The arithmetic mean of these values is unity. 
Therefore, the deviation of any relative value, B, from the 
mean is 1 - FG 

Let it be assumed that the value of the arithmetic mean of 
the original measurement, /7, is unknown and is represented 
by Z . Then any measurement, 77, expressed as a fraction of 
Z ,is 3 . According to the discussion in the two preceding 
paragraphs, it might appear that 2 must have such a value that 
the sum of the squares of the deviations, 1 - Zz, is a minimum. 
However, this is not the case. It may be demonstrated that the 
value of the expression  (/f -<7 + ad J, is a minimum when 
Z has some other value than the arithmetic mean of the original 
measurements. The sum of the squares of the residuals may be 
written, 7-22 “2 m+Z°Z m* Differentiating this expres- 
sion with respect to Z and equating to zero yields the following 





W .A. HENDRICKS 


equations for the determination of Z : 
ait . 
(7) 22° m-22Z EF m*=0 


_ 2 
(8) 2-Fm 


The value of Z , calculated by means of equation (8), is 
obviously not the arithmetic mean of the original measurements. 
The fallacy in the deduction of this equation is readily apparent. 

Instead of using residuals of the type, /- 7 , and differ- 
entiating the sum of the squares of the residuals with respect to 
Z , one should use residuals of the type, V- 3 , in which V 
represents the arithmetic mean of the relative values, 3, of the 
measurements. The sum of the squares of the residuals should 
be differentiated with respect to V.. The square of the residual, 
V- ¥ , is V~- oF By and the sum of the squares of all 
the residuals may be written nVv*~ err ™ +z ne* 
Differentiating with respect to V and equating to zero yields 
the following equations for the determination of V : 


(9) 2nV-FZ£m<0 


(10) V- £Zm 
£-— 


Since the value of V is known to be unity, equation (10) 
may be written: 


(11) =-£2£m 


from which Z may be readily calculated as follows: 
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(12) z-o 







Equation (12) is obviously nothing more than the simple 
formula for the calculation of the arithmetic mean of the orig- 













inal measurements, which is sufficient evidence that the reasoning 
involved in its deduction is sound. 

It is now readily apparent why equation (6) did not yield 
results which were consistent with the data in Table III. The 
ratio, , is analogous to the ratio, z, and residuals of the 
type, V- ar , Should have been used in fitting the equation 
instead’ of oe of the type, | “AXE, . The square of the 
residual, V~ ,is V*- + . The sum of the 
squares of he 41 residuals is 4 V2. BvD Ly +#Z xa 
Differentiating this expression with respect to V and equating 
to zero yields the following equations for the determination of V : 


(13) 





(14) 


Substituting the known value, unity, for V in equation (14) 
yields the following equations for the determination of A . 













(15) 41-27%, 


(16) A= Ze 
41 








ER Tae per cer ee 2 
1Residuals of the type, as - 1, are analogous to those of the type, 
= - 1, which also lead to incorrect results. 





W .A. HENDRICKS 471 


Applying equation (16) to the datain Table III gives A a 
value of 100.0000, which coincides exactly with the true value from 
which the data were originally calculated. Equation (16) was, 
therefore, the correct equation to use in interpreting the data given 
in Table III. Although the use of residuals of the types, ae - 1 
and 1 - = , gave better approximations to the true values of 
A than the use of the simple residuals, AX Y, neither of 
the two gave results which were entirely in accord with the deriva- 
tion of the data. 

Yule! suggested that the geometric mean might often prove 
useful in comparing the frequency distributions of different sets 
of data, in which the dispersion of the individual measures about 
their means was influenced by the magnitude of the means. It 
appeared to the writer that the use of residuals of the type, 
log AX: = log Y, might give a good approximation to the true value 
of A in fitting the given equation. It is evident — the ratio, 


as , approaches unity as the residual, log Ax? log Y , ap- 
proaches zero. 


This logarithmic residual may be written, log 4+2 log x- 
log -Y , and its square is (log A * * A(logX)* + (log yy 
+ 4(log_A ) (log X ) - 2(log A ) (log Y ) ~ 4(log X ) (log Y ). 
The sum of the squares of the 41 residuals is 41(log A )* 
+4Z (log X fo + XZ (log Y)* + 4(log A )Z (log X ) 
- 2(log A) & (log Y) - 4Z (log X .log Y ). Differentiat- 
ing this expression with respect to log A and equating to zero 
yields the following equations for the determination of A : 


(17) G2AogA)+ 4Z fog X)-ZE (log Y) =O 


(log Y) - 22 (log X) 


(18) tog A= = 
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The value of logA , calculated from the given set of data by 
means of equation (18), is 1.9997369, which gives A a value 
of 99.9394. This value of A comes closer to the true value than 
those calculated by means of residuals of the types, 1- zy 
and ~y - 1. However, since the use of the geometric mean 
is not rigorously justified when the distribution of the measures 
about the arithmetic mean is symmetrical, the use of logarithmic 
residuals in curve fitting can not give precise results when the 
errors of measurement are distributed as they were in the given 
set of data. 

In any application of the method of least squares to a prac- 
tical problem, the procedure of the investigators should be gov- 
erned by the nature of the data to which it is being applied. In 
many instances the correct procedure can be deduced by a careful 
consideration and evaluation of the accuracy of the methods of 
measurement used in obtaining the data. Unfortunately, however, 
some sources of error are not always readily apparent at the time 
the data are collected, and occasionally can not be quantitatively 
estimated even though they are known to exist. If the nature of 
the mathematical relationship existing between the dependent and 
independent variables is known, all that remains is to find the 
most probable values of the constants in the equation. 

A statistical study of the deviations of the observed values 
of the dependent variable from the corresponding calculated val- 
ues, obtained after fitting the equation by several different meth- 
ods, may be of much help in deciding which method of fitting was 
most consistent with the nature of the data. For example, Table 
V gives the results of applying ‘the chi-square test for contingency 
to the distribution of the deviations of the observed values of Y 
from the calculated values obtained when residuals of the type, 
AX*-Y | were used in fitting the equation, Y=AX 7 to the 
data in Table III. The value of F is only 0.005061 and a mere 
inspection of the table itself shows that large deviations tend to 
occur more frequently, and small deviations less frequently, as 
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TABLE V 


Chi-square test for contingency applied to. the distribution of the 
deviations of the type, AX *_Y. The theoretical frequencies 
for each compartment are given in parentheses. 


Value of Magnitude of Deviation 


Xx 
10 to +2000to | +4000to | +6000and 
+ 1999 + 3999 + 5999 over Total 
10 0 0 0 


10 


1 to 10 


(7.3171) | (1.2197) | (0.9756)| (0.4878) 


11 to 20| 10 0 0 
(7.3171) | (1.2197) | (0.9756)| (0.4878) 


21 to 30| 6 3 0 
(7.3171) | (1.2197) | (0.9756)| (0.4878) 

31 to 41 1 2 
. (1.3415) | (1.0732)] (0.5366) 


E Acctalibeanal 


xX? = 23.5989 
7’ = 10 
P = 0.005061 
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the values of X increase. If the true nature of the values of 
Y in Table III were not known in advance, this distribution of 
the deviations would be sufficient evidence that the method of 
fitting the equation was not consistent with the accuracy of the 
measurements made when the data were collected. 

Tables VI, VII, and VIII give, respectively, the distributions 
of the deviations of the types, AS » ie aye , and 
log AX < log Y , when the corresponding residuals were used 
in fitting the equation.! The value of P is high in each case, in- 
dicating that, although the use of residuals of these types did not 
give results which were precisely accurate, nevertheless, they 
yielded values of A’ which were well within the limits of the 
probable error to be expected in any practical investigation. 






























As a matter of fact, this is a rather fortunate circumstance, 
since the only method of fitting the equation given above which 
yielded exactly the correct value of A cannot be applied to 
fitting an equation containing more than one undetermined con- 
stant. The applicability of residuals of the types, 1 - x and 
log £(X)- log Y is also somewhat limited. However, any 
equation which can be- fitted by the method of least squares at 
all can still be fitted when residuals of the type, LS —1, are 
employed. 


SUMMARY AND CONCLUSIONS 





The method of least squares can be a more valuable tool in 
statistical work when the furidamental theory upon which the 
method is based is taken into consideration. The use of residuals 
of the type, *(X)-Y, is probably justified in fewer practical 





1The distribution of the deviations obtained when the equation was fitted 
to the data by means of equation (16) is identical with the distribution of 
the errors given in Table IV. 
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problems than the use of residuals of some other form. The type 
of residual to be employed should be governed by the nature of 
the data to which the method of least squares is being applied. 

The use of relative residuals of the type suggested by Pearl 
and Reed may be of much value in many instances but will not 
give results which are precisely accurate, even though the dis- 
tribution of the percentage errors of measurement is strictly. nor- 
mal. The results can be improved by expressing the deviations 
of the observed from the calculated values of the dependent vari- 
able as fractions of the calculated, rather than the observed, value.? 

The use of logarithmic residuals may give more accurate 
results than the use of residuals of the type suggested by Pearl 
and Reed, even though the distribution of the percentage errors 
of measurement is normal. 

The chi-square test for contingency may be of much help in 
selecting the type of residual most consistent with the errors of 
measurement made in obtaining the data when sufficient informa- 


tion regarding the accuracy of the measurements is not available. 


1Residuals of this type have been used by Hendricks, Lee, and Titus at the 
U. S. Animal Husbandry Experiment Farm, Beltsville, Maryland, in the 
fitting of growth curves. 

Hendricks, W. A., A. R. Lee, and H. W. Titus. Early growth of White 
Leghorns, Poultry Sci. 8 (6); pp. 315-327 (1929). 

Titus, H. W., and W. A. Hendricks. The Early Growth of Chickens as a 
Function of Feed Consumption Rather Than of Time. Conference Papers 
of the Fourth World’s Poultry Congress, Section B (Nutrition and Rear- 
ing): pp. 285-293 (1930). 

The use of such residuals leads to results which appear to give a better 
description of the data than when simple residuals of the type, f(. X) -Y ‘ 
are employed. 
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TABLE VI 


Chi-square test for contingency applied to the distribution of the 
deviations of the type, a“— ~ 1. The theoretical frequencies 
for each compartment are given in parentheses. 


Value of Magnitude of Deviation 
+ 0.019 + 0.039 + 0.059 over Total 
1 to 10} 4 3 2 1 10 
(4.1463) | (3.1707) | (1.7073)| (0.9756) 
11 to 20|) 4 4 1 1 10 


(4.1463) | (3.1707) | (1.7073)| (0.9756) 


21 to 31) 4 3 1 2 10 
(4.1463) | (3.1707) | (1.7073)} (0.9756) 

31 to 41) 5 3 3 0 11 
(4.5610) | (3.4878) | (1.8780)| (1.0732) 


2 





X~= 3.8182 
7’ = 10 
P = 0.921027 
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TABLE VII 


Chi-square test for contingency applied to the distribution of the 
deviations of the type, 1 — ant . The theoretical frequencies 
for each compartment are given in parentheses. 












Value of 


Xx 


Magnitude of Deviation 


+ 0.020 to t 0.040 to + 0.060 and 
+ 0.039 Ravine 059 over 


(1.7073) | (0.9756) 
4 3 2 1 
(1.7073) | (0.9756) 
4 3 1 2 
(1.7073) | (0.9756) 
° 4 2 0 
(1.8780) | (1.0732) 





31 to 41 





oa 
uu 
er 
a 
S 
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TABLE VIII 


Chi-square test for contingency applied to the distribution of the 
deviations of the type, log AX 2 log Y . The theoretical fre- 
quencies for each compartment are given in parentheses. 


Value of Magnitude of Deviation 


x 0.000 to + 0.010 to + 0.020 to + 0.030 and 
+0.009 +0.019 +0.029 over 


1to10| 6 2 2 0 
(6.0976) | (2.4390) | (0.9756) | (0.4878) 


11to20| 6 3 0 
(6.0976) | (2.4390) | (0.9756) | (0.4878) 

21 to 30) 6 2 1 
(6.0976) | (2.4390) | (0.9756) | (0.4878) 

31to41| 7 3 1 
(6.7073) | (2.6829) | (1.0732) | (0.5366) 

4 


2 


X*= 4.4989 
7’ = 10 
P = 0.872945 








EDITOR’S NOTE 


It is with great pleasure that the Annals brings to its readers 
information concerning the Nordic Statistical Journal, edited by 
Dr. Thor Andersson. This publication is of great merit, and the 
work ‘of its contributors compares very favorably with that found 
in Biometrika and Metron. Americans will do well to study care- 
fully the contributions which Scandinavians are making to statis- 
tical methodology. 
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Nordisk Statistisk Tidskrift started in 1922. It is chiefly written 
in Nordic tongues. There are also published articles in English 
and German. To some articles in Nordic there are summaries 
in English or German. Now the chance is taken to realize the 
original scheme of publishing two editions, one in Nordic tongues 
and the other in English. The edition in non-Nordie tongues 
is published in English also because of the fact that the millions 
of descendants of the Nordic peoples, now living beyond the 
boundaries of the Nordic states, are mainly working in English- 
speaking countries. 


Nordic Statistical Journal has five departments: articles, reviews 
of books, minor communications, bibliographical lists of Nordic 
statistics, and recent periodicals and new books. In general, all 
departments will be represented in every number. 

Nordic Statistical Journal is published quarterley, the four 
numbers making a volume of about 640 pages. The subscription 
rate for a volume — post free — is 30 Swedish crowns. 

Subscriptions may be sent to Nordic Statistical Journal, Stock- 
holm, Sweden. 

The subscription rate through booksellers is 35 Swedish crowns. 

Editorial communications and all publications should be adressed 
to THOR ANDERSSON, Dr. Ph., Stockholm, Sweden. 


Aftonbl. tr., Sthim 1929. 
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Nordic Statistical Journal. Volume 1. Edited by THOR ANDERS- 
SON. Stockholm 1929. Pp. 639. Reviewed by Dr. phil. CARL 
BUKRAU. 


We, the inhabitants of the Nordic countries, are perhaps 
somewhat inclined to take a certain inner pride in our — 
as it seems to us — high civilization and to attach still more 
importance to ourselves in this respect during the later years, 
when the ’Ragnarék” of the great war had devastated most 
of the other civilized countries and handicapped them in their 
competition with us. Let us hope that there are some good 
grounds for our selfsatisfied opinion! It is not difficult to 
find some facts indicating that we are right in this self-respect, 
even if we go to the very summits of civilization — let us 
think of the "Acta matematica”’, for instance. But if we 
are right, it may be very necessary for us to be on our 
guard against the danger of stagnation, of the standstill, 
where we begin to lull ourselves into the pleasant dream that 
our position is unshakeable, and that we may now repose on 
our laurels. Therefore, we must honour the persons who do not 
allow us to go to rest, the persons who spur us on to do our 
very best. 

Thor Andersson is one of those whom we must honour for 
such an influence. In the field of statistics he seeks to be our 
scientific conscience. He swings his whip over our heads 
mercilessly and drives to activity everybody who is able to 
produce something, however small or great, within the field 
of statistics. But he is not content with that! He is not con- 
tent with the achievement of having filled a long and im- 
posing row of volumes of the "Nordisk Statistisk Tidskrift” 
with valuable essays and treatises written by Scandinavian 
as well as by leading foreign authors — all the non-scandi- 
navian countries are now to see and feel the warmth of the 
light from the North. His journal is now to become an inter- 

485 








466 NORDIO STATISTICAL JOURNAL 


national publication, but still with an indication of its Nordic 
origin in its title. The first volume of the "Nordic Statistical 
Journal” — simultaneously forming the 8th volume of the 
original journal — has appeared. And it is not a trifling thing, 
this volume of 639 pages in great octavo! It is great in its 
composition, still more soaring in its purposes and ends for the 
future, and promising, when we consider what "the man at 
the wheel” has collected in these 639 pages by means of an 
unusual perseverance in unfailing love for the task and in 
spite of many — too many — external adversities. 

The leading thought of the work is the same as, now soon 
a decennium ago, led Thor Andersson to found the Nordisk 
Statistisk Tidskrift. It is a child of the Greeks’ idea of 
chaos and cosmos, or rather a consequent, modern continua- 
tion of this idea. Statistics is the most important means for 
bringing our existence over from chaos to cosmos. Statistics 
acquaints us with the real circumstances, and the knowledge, 
the real knowledge of the things, will then show how to 
bring things in their right places, so that the entirety becomes 
the arranged cosmos. But there is still much to do! We have 
not yet been able to elevate statistics to the rank of an observ- 
ing natural science it should have, to be able to give us 
the real science of the things, alluded to above. In 1922 the 
thought was to be in the front-rank in the work for this pur- 
pose. And we have to be obliged to Thor Andersson for the 
strenuous work he has performed for his idea during the past 
years, and now it will be done on a still broader basis, i. e. 
for an international public, yet under Nordic leadership. 


Let. us study a little more closely how this new volume 
I seeks to perform its work in the service of the mentioned 
idea. 

With, in a good meaning, a journalistic feeling for actualities 
the volume appears as a sort of jubilee-gift to Bortkiewicz 
on his sixtieth anniversary and it is therefore opened by 4 
good full-page picture of this scientist who has given sv 
valuable contributions to the original journal. To the reade:, 
the following essays seem to arrange themselves into three 
groups which — just in order to give a name to the special 
groups — could be designated’ as olden times, present times, and 
Suture times. 
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In the group belonging to the olden times, the editor seeks 
to show how deeply rooted the statistical science is in the 
Nordic peoples by introducing a number of great men of 
Nordic origin, each in his way, a pioneer. These men are 
represented partly in full page pictures, partly in the text. 
It may not surprise us that none of them is a "professional 
statistician”, for the profession is only now being created. 
But they belong, each in his way, to the founders of this 
branch. In the eightteenth century Wargentin, the astronomer, 
founded population statistics which is of fundamental im- 
portance to demographics. The essay on him is particularly 
well written by Nordenmark. 

The memory of the now nearly forgotten Eilert Sundt, who, 
by his activity as a clergyman, was brought to make scienti- 
fical investigations of the society where he lives, and who 
gradually becomes a social-statistician of high rank, is re- 
vived both by a reprint of his peculiar essay of 1858: "On 
Piperviken and Ruselokbakken (Investigations of the condi- 
tions and morals of the working-class in Christiania)” and 
by a scientific estimation of him (’Eilert Sundt’s law’) by 
Rygg who also gives an instructive account of how Sundt 
was disfavoured by his contemporaries, naturally in the first 
place by the politicians who had to do with the granting of 
money for his investigations! Unfortunately the politicians 
of the present times are not better; about that Thor Andersson 
himself could write a sad chapter! 

Then follows Thiele, whose principal scientific passion, "the 
theory of observations”, is simply the foundation of what is 
now more generally called mathematical statistics, and finally 
Johannsen, the investigator of heredity, who is commemorated 
by a picture as well as by a reprint and a translation into 
English of his contribution to the first volume of the original 
journal: "Biology and statistics”. 

Several other essays like those mentioned, also belong to 
the olden times. Thus H. Palmstréms’s essay on the first census 
in Norway in 1769 and that of Thorstemsson on the census of 
Iceland in 1703. 


The essays which the reader naturally refers to the "present 
times” are evidently caused by the editor in order to 
show the non-Scandinavian world the conditions in the 
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Nordic countries in two respects, both extremely important 
from a statistical point of view: the population registration 
and the industries, thus, firstly, how we gain our knowledge 
about the number and the composition of the population, 
and, secondly, how these people support themselves. 

The editor could not have found any person more fit to 
write the general” article about population registration than 
Amnéus, the director of the Oslo population register, whose 
institution is up to the standard and also has served as a 
model in many places, among others in Denmark. The condi- 
tion of these matters in the different countries is further 
treated by the editor as far as Sweden is concerned, and as to 
Denmark by not less than two authors, Bonde and Dalgaard, 
and with regard to Finland by Kovero. These are very 
instructive essays which illustrate the importance of these 
things in the right way. One learns how even the "torso” 
(a not unjustified epithet for the arrangement incroduced 
in Denmark, which was originally excellently planned, 
but which has been more than half-way broken to pieces by 
smallminded and short-sighted politicians, of course under the 
pretext of economy) of a population register, as that of Den- 
mark, thanks to ti fact that it is obligatory, gives an ex- 
cellent support in many ways, among others for the 5-year 
censuses. One learns how deplorably far behind matters are in 
Wargentin’s native country, where it was naturally necessary 
to perform registration in the large towns, but how the 
acclomplishment of the work is hazarded by the rather an- 
tiquated and burdensome obligatory collaboration with the 
clergy — and by still many other things. A survey like this, 
presented to an international audience is perhaps more than 
anything else suited to advance the population register mo- 
vement, which shall, however, once triumph by its inner 
necessity. 

Next the cditor has intended to give a picture of the 
industrial statistics of the Nordic states. He has himself 
undertaken to treat the most important part: "the mother 
industry”, agriculture. 

Aage I. C. Jensen treats fishery, Skdien shipping, Geijer the 
ore resources of the Nordic countries, Velander the water 
powers of the Nordic countries, and, forestry, finally, is treated 
by Aminoff (Sweden), Sandmo (Norway), and Cajander (Finland). 
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To the "present times” we may also count Storsteen’s, to us, 
the inhabitants of the Nordic capitals very interesting article 
on "The expenses of living in the Nordic capitals”, a subject 
full of pitfalls for a less experienced statistician, but here 
treated with excellent fineness and with a clear prescience of 
the difficulties. 


To the same group belongs Thorberg: Statistics and trade- 
union movement’, a rather short but extremely interesting 
essay, not least on account of the author’s position as 
president of the national organisation of the Swedish trade- 
unions. Here fall the weighty words about the social-political 
institution erected by the League of Nations for internatio- 
nal labour organisation, that "the work of this organization 
is rendered extremely difficult by the fact that it has hitherto 
been almost impossible to arrive at any comparability between 
the statistics of the different countries”. 

Finally there is Linder’s: "Some remarks on the income 
statistics of the census in 1920”, which, according to its title, 
seems to belong to the present times, but which, according to 
its contents, is in the first place, a scientifical arithmetical 
example for the illustration of the applicability of Pareto’s law, 
concluding in some wishes with regard to the future official 
investigations of income. This essay can therefore be said 
to form the transition to the last group. 


When I have permitted myself to designate the third group 
of essays as future”, there may thereby, as a matter of fact, 
not be understood any paradoxical possibility of prophesying 
the statistics of the future. I have only wished to emphasize 
the editor’s desire that his journal may also be one of the 
laboratories where the instruments for the treatment of the 
future statistics are created. This side of the matter has al- 
ways had the editor’s supreme interest; you may think only 
of the contributions to the previous volumes, which Bortkie- 
wicz, Tschuprow and others have brought. We can call this 
side the theoretical or perhaps the mathematic-statistical one. 
It has been an urgent need in this volume I to show, that 
also we, in the Nordic countries think of this side of the 
matter, and among the authors of the six essays of which 
this group consists (besides the above mentioned contribution 
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by Linders) we also find all the four Nordic countries repre- 
sented. 
ithough the chief importance of treatises of this kind 
would seem to fall within the realms of theory, still one of 
this essays, namely Nybdlle’s "Interpolation in statistics’ is 
of rather eminent practical importance and use. Heside — 
or rather because of — his clear and sharp differentiation bet- 
ween the purely mathematical and the statistical meaning of 
the word interpolation and the very near connection of the 
last mentioned notion with that of adjustment, he here gives 
exclusively practical advice and instructions useful in cir- 
cumstances which, so to say, belong to the every day life 
of the statistician. This treatise will be very welcome to 
many colleagues. In some opposition hereto stands Ragnar 
Frisch: "Correlation and scatter in statistical variables”, in 
size as well as in importance one of the biggest treatises of 
the volume which will prick the conscience of many as it 
will make them clearly feel the obligation to penetrate more 
deeply into its contents -— the author himself namely tells us 
that he has come "to various results, some of which are 
known, and others which are new, so far as I am aware” — 
but will easily feel frightened by the author’s imposing mathe- 
matical apparatus, which is nothing less than n-dimensional 
vectors and appertaining matrices and orthogonal transforma- 
tions. Bui one ought not to be frightened by these heavy im- 
plements. And this so much the less as the author presupposes 
no elementary knowledge in the field of vector calculation. 
On the contrary, he explains his whole apparatus thor- 
oughly. There is no need of having heard words as vector or 
orthogeral before, and one will still be able to study this 
work, which, on this account has naturally become somewhat 
extensive (67 pages). It may be greeted with great satis- 
faction that one thus begins to attack the problems of 
theoretical statistics with such weapons. Terms such as 
scatter and correlation are so fundamental to science that we 
cannot take into use an apparatus precious enough to attack 
the problems contained in these terms. Here the best is 
not too good. 
Further we meet Jerneman: "To the method of sampling” 
and Lindeberg: "Some remarks on the mean error of the per- 
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centage of correlation”, essays well suited to waken respect 
for Nordic science abroad. 

The contact with the kindred science, social economics, is 
in the volume attended to by Gjermoe: "The amplitude of 
industrial fluctuations”, an essay covering 63 pages, which 
struggles with the difficult and not yet very well defined 
notions: times of ascent and descent, crises, fluctuations, and 
so on. The notion of "trend”, so prominent in all the ultra 
modern investigations, belongs to the here used apparatus, 
and in a quotation — this essay is very abundant in quota- 
tions, which will be praiseworthy — we learn that it was 
already found by Hooker in 1901 and was defined by him as 
"the direction in which the variable is really moving when 
the oscillations are disregarded”. 

As rather specially belonging to the "future” we meet fi- 
nally Angstrém’s brilliant essay on meteorology and statistics 
and have a presentiment that the latter is destined to play 
once the principal réle before the former. 


We will not end this review without another congratulation 
ta Thor Andersson for having brought forth this volume I, 
followed by the wish and hope that his ideal struggle for the 
highest aims will meet with the wished for success before he 
grows too tired to fight against adversity. It is sadly known 
that the Swedish Riksdag has not granted him the necessary 
subsidy in spite of the fact that such recommendations as the 
following one could be appended to the petition: 

"By the way in which Dr. Thor Andersson has edited Nor- 
disk Statistisk Tidskrift, he has, according to our opinion, 
rendered great services towards the advancement of scienti- 
fical statistics and towards the spread of the knowledge of 
its extraordinary importance, not only for other sciences, 
but also, and not least, for the obtaining of a real know- 
ledge of the social and economical structure of society, a 
knowledge that is necessary if the public measures of correct- 
ing social evils and of furthering industry will have the 
wished for effect. His name guarantees that the journal 
will, also in the future, hold the same prominent position as 
it now holds among publications of this kind. 

Stockholm January 17th, 1929. 


E.. Phragmén. P. G. Laurin.” 
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When not even this was of any use, one does not know what 
to do. Judging by former experience, it would be of still 
less use to refer to the fact that this journal is an honour for 
its country (and for the Nordic states in general) as there 
is not like it in any of the foreign countries. For from such 
an argument, presented by a favourer of Eilert Sundt in the 
Norwegian Storting, the short-sighted politician Jaabaek drew 
the conclusion that it was surely so because such things 
were superfluous! The grant to Sundt was denied and he 
withdrew to a clergyship. Will the present politicians really 
dishonour themselves still more in connection with this en- 
terprise? Let us hope that the means will be found for 
carrying on this great work. 


Sthim, Aftonbladets tr., 19380 
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